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"The  Time  may  not  be  very  remote  when 

it  will  be  understood  that  for  complete 

initiation  as  an  efficient  citizen  of 

one  of  the  new  erreat  complex  world  wide 

states  that  are  now  developing,  it  is  as 

necessary  to  be  able  to  compute,  to  think 

in  averages  and  maxima  and  minima,  as  it  is 

now  to  be  able  to  read  and  to  write." 

— H.  G.  wells, 

Mankind  in  the  Making. 
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PREFACE 

The  success  of  statistical  techniques  in  improving  the 
quality  of  manufactured  goods  has  led  to  the  adoption  of  these 
techniques  into  non-manufacturing  fields.   Today  these  highly 
developed  mathematical  techniques  are  providing  case  histories 
of  successes  in  overcoming  problems  concerned  with  the 
improvement  of  goods,  services,  and  performance  through  better 
control  methods,  and  through  analysis  of  data  in  decision 
making.   Statistics  is  commonplace  in  the  fields  of  stock 

market,  advertising,  sales  surveys,  and  accounting.   The  statis- 
tical methods  based  on  the  law  of  probability,  which  were 
successful  in  controlling  war-production  quality,  are  now  being 
applied  to  service,  staff,  and  administrative  activities. 

Although  there  are  many  favorable  reports  and  case 
histories  to  substantiate  the  value  of  the  application  of 
statistical  technique  to  decision  making  there  are  many  who 
do  not  acceot  these  methods.   Statistics  are  "dry"  and  there  is 
mystery  in  figures  and  formula.   There  are  many  who  distrust 
figure  people.   Statistics  is  a  scare  word  and  hence  must  be 
sold  to  be  accepted. 

It  is  the  purpose  of  this  paper  to  provide,  in  as  far 
as  is  possible  in  dealing  with  a  subject  of  this  sort,  a 
nontechnical  discussion  of  these  statistical  techniques.   It 
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is  not  the  purpose  here  to  do  more  than  merely  scratch  the 
surface  of  statistical  theory  to  provide  only  what  is  necessary 
to  demonstrate  the  application  of  statistical  method  in  solving 
rather  routine  problems.   It  is  not  necessary  to  become  a 
skilled  mathematician,  the  technical  phases  of  statistical 
method  can  be  delegated  to  the  technicians — if  it  is  clearly 
understood  at  the  management  level  what  the  objectives  are 
and  if  proper  support  is  given  the  staff.   Although  the 
comptroller  need  not  know  detailed  technical  applications, 
he  should  be  interested  and  have  developed  some  feel  and 
appreciation  for  such  methods.   The  use  of  statistical  and 
mathematical  method  is  in  reality  a  front-office  tool  and  the 
higher  one  ascends  within  the  management  structure  the  more 
inroortant  It  becomes  as  a  management  technique  of  analysis 
and  control. 
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CHAPTER  I 
STATISTICS  AND  THE  COMPTROLLER 

Introduction 
Statistics  is  the  offspring  of  a  varied  ancestry.   As 

a  result  of  greed  the  ancient  kings  enumerated  their  people 
for  the  purpose  of  taxation;  the  panic  of  an  English  Sovereign 
during  the  London  plague;  the  cupidity  of  professional  gamb- 
lers; the  scientific  ardor  of  the  psycho-physicists;  the  labors 
of  mathematicians  and  astronomers  and  physicists;  students  of 
social  phenomena,  biologists,  and  educators  planning  a  new 
science  of  education;  from  these  statistics  has  descended. 

Today  the  field  of  statistics  has  many  followers,  yet 
there  are  many  more  who  feel  that  any  one  who  deals  with 
numbers,  and  hence  statistics,  cannot  be  trusted.   Even  though 
there  is  a  strong  negative  view  toward  the  concepts  of  statis- 
tics, it  has  nevertheless  become  firmly  established  as  an 
Indispensable  tool  of  the  comptroller  and  scientific  manage- 
ment. 

Statistics  has  become  a  basic  tool  of  decision  making 
and  is  a  bssiF  for  many  important  management  functions  such  as 
market  and  scientific  research,  budgeting  and  forecasting, 


Helen  M.  Walker,  Studies  in  the  History  of  Statistical 
Method  (2d  printing;  Baltimore,  Md. :   The  Williams  and 
Wilkins  Company,  January  1931),  p.  1. 
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purchasing,  production  planning,  quality  control,  engineering, 
etc.   Of  these  broad  applications  the  military  comptroller  is 
vitally  interested  in  reliable  methods  and  techniques  for 
budget  prediction  and  forecasting. 

Many  in  the  field  of  comptrollershlp  do  not  understand 
how  this  vital  tool  can  be  used  to  their  advantage;  there  is 
inadequate  understanding  of  this  powerful  tool  even  among  many 
persons  engaged  in  the  fields  where  statistics  has  its  greatest 
application. 

One  might  ask  why  this  new  technique  of  scientific 
management  is  not  better  understood  and  accepted.   Some  educa- 
tors give  as  the  paramount  reason  the  fact  that  statistics  was 
a  rare  course  in  our  colleges  during  the  college  years  of  most 
of  our  key  executives  today.   And  even  today  one  can  still  find 
many  schools  of  business  administration  of  so?i>e  of  our  leading 
colleges  that  do  not  have  an  adequate  statistical  curriculum. 
What  Is  Statistics  and  Why  the  Growing  Interest? 

Most  educators  will  define  statistics  as  that  branch 
of  applied  mathematics  which  deals  with  the  collection,  tabu- 
lation, presenting,  analyzing,  and  interpreting  quantitative 
data. 

There  are  many  reasons  for  the  growing  interest  in 
the  analysis  of  quantitative  data.   One  is  that  the 
tools  of  analysis  have  become  sheroer  as  refinements 
have  been  made  in  statistical  techniques.   But  the 
principal  reasons  for  the  increasing  use  of  these 
methods  are  to  be  found  in  certain  basic  changes 
which  have  taken  place  in  the  American  economy. 

First,  there  has  been  a  rapid  transition  from  the 
day  of  small  local  enterprise  to  an  era  of  centralized 
large-scale  production.   As  the  size  of  the  business 


unit  has  increased ,  the  imoortance  of  executive 
decisions  has  increased.   Today,  great  corporations 
exceed  in  wealth  the  sovereign  states  which  created 
them.   Never  before  have  business  executives  controlled 
so  much  wealth,  nor  have  so  many  persons  been  dependent 
on  a  single  enterprise,  and  never  before  have  mistakes 
been  so  costly  in  terms  of  money  and  human  suffering. 

The  responsibilities  of  the  modern  executive  have 
grown  with  the  scale  of  his  operations,  but  his  ability, 
as  an  individual,  to  reach  proper  decisions  has  not 
kept  pace.   A.  C.  Nielsen,  of  the  A.  C.  Nielsen  Company 
of  Chicago,  is  a  marketing  expert  who  employs  quantitative 
data  in  analyzing  problems  of  marketing  for  some  of 
America's  largest  corporations.   Always,  before  submitting 
an  analysis  of  a  particular  problem  to  the  executive  who 
has  ordered  it,  he  asks  the  executive  what  he  considers 
the  answer  to  be.   He  has  kept  a  record  of  the  replies. 
According  to  Mr.  Nielsen  the  executives  have  been  wrong 
42  per  cent  of  the  time. 

Lacking  objective  statistical  studies  to  guide  them, 
the  executives  would  have  made  almost  as  many  wrong  as 
right  decisions. 

A  second  development  in  the  American  economy  which 
has  stimulated  the  use  of  quantitative  data  is  the  great 
variation  which  has  appeared  in  economic  activity  as 
cyclical  prosperities  and  depressions  have  followed  one 
another  in  rapid  sequence.   One  effect  of  the  apparently 
increasing  instability  of  the  economy  has  been  to  create 
an  urgent  need  in  the  fields  of  business  and  government 
for  a  type  of  analysis  of  economic  conditions  which  nay 
enable  the  executive  to  see  a  short  distance  into  the 
future,  to  detect  the  shape  of  things  to  come  shortly, 
at  least,  before  they  arrive  with  the  devastating  £orce 
of  depression  or  the  buoyant  effect  of  prosperity.2 

There  has  been  a  growing  use  of  statistics  in  govern- 
ment as  well  as  in  business.   As  the  government  continued  to 
grow  in  size  and  to  assume  many  functions  statistical  tech- 
niques were  increasingly  emphasized.   The  complexities  of 
planning  for  the  economy  as  a  whole,  the  obligations  assumed 
by  the  various  federal  bureaus  in  administering  the  affairs  of 


p 

William  Addison  Neiswanger,  Elementary  Statistical 

Methods  (New  York:   Macmillan  Company,  1951),  p.  2. 
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industry  and  agriculture,  and  providing  for  the  national 

security  means  that  decisions  made  by  government  officials  are 

of  great  importance  in  terms  of  money  and  human  welfare.   In 

this  area  it  is  obvious,  mistakes  can  be  most  serious,  and 

decisions  based  on  personal  direct  observation  are  unlikely. 

The  duties  and  responsibilities  of  the  comptroller  of 

the  Navy  stem  directly  from  those  specified  in  Title  IV  of  the 

National  Security  Act  Amendments  of  1949.   Title  IV  states 

among  other  things  that  the  comptroller  shall: 

Establish  and  supervise  principles,  policies,  and 
procedures  of  organizational  and  administrative 
matters  relating  to  the  preparation  of  cost  budgets, 
fiscal  operation,  capital  property,  accounting,  and 
progress  and  statistical  reporting.   (italics  mine.)-5 

The  Secretary  of  the  Navy  in  setting  forth  the  basic 
concepts  of  coraptrollership  in  the  Navy  emphasized  the  statis- 
tical function  by  requiring  the  comptroller  to  be  responsible 
for  "developing  guides  and  criteria  for  the  collection  and 
coordination  of  statistical  data  and  preparing  special  statis- 
tics  as  required.   (italics  mine,; 

The  Controllers  Institute's  Committee  on  Ethics  and 
Eligibility  Standards  emphasizes  statistics  as  a  Comptroller- 
ship  function  and  states  among  other  things  that  the  controller 
is  responsible  for  the  compilation  of  statistical  records  as 
required. 


"u.  S.,  Congress,  National  Security  Act  Amendments  of 
194^,  Public  Law  216,  8lst  Congress,  1st  session,  19W,' 
Title  IV. 

^U.  S.,  Department  of  the  Navy,  Secretary  of  the  Navy 
Instruction  5400.4,  November  18,  1953,  p.  1. 
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At  this  point  it  might  be  well  to  emphasize  this  point 
that  it  is  not  the  purpose  of  this  paper  to  la.ply  that  the 
comptroller  be  a  trained  and  accomplished  statistician.   The 
comptroller  needs  to  make  decisions  rapidly,  based  upon  timely 
and  accurate  data.   It  is  only  through  the  modern  methods  of 
statistics  and  applied  mathematics  that  such  ■  desire  for  data 
collection  can  be  realized;  thus  it  becomes  important  that  the 
military  comptroller  while  essentially  a  generalist  and  not  a 
specialist,  at  least  among  line  officers,  become  thoroughly 
familiar  with  the  capabilities  and  relationships  of  statistic- 
ians and  statistics  in  the  broad  field  of  management. 

Management  is  concerned  with  what  must  be  done,  thus 
it  becomes  necessary  for  the  manager  to  make  some  sort  of  fore- 
cast of  what  events  are  likely  to  happen.   Often  this  is 
accomplished  by  using  past  experience  as  a  guide,  or  his 
intuition  but,  although  these  can  at  times  be  good  guides,  they 
can  also  be  disastrous  mistakes  which  could  have  been  avoided 
were  more  accurate  and  timely  data  available  and  effectively 
used. 

To  be  effective  management  needs  to  know  such  things 
as  the  relationship  existing  between  ore  set  of  events  and 
another,  and  the  trends  established  by  these  events,  nnd  causes 
of  these  events  in  order  to  add  a  degree  of  reliability  to  its 
forecasts.   Usually  no  invariable  relationships  of  cause  and 
effect  can  be  found  in  the  past  or  predicted  of  the  future  and 
so  the  manager  has  to  deal  in  the  realm  of  probabilities.   This 
he  is  by  training  ured  to  doing  as  a  result  of  mostly  dealing 


6 
with  human  relationship  and  individual  persons,  whose 
individual  actions  cannot  be  predicted  with  any  degree  of 
certainty. 

There  are  still  those  in  the  field  of  management, 
however,  who  are  afraid  of  figures.   They  feel  that  they  can 
see  and  hear  what  is  going  on  and  do  not  need  figures  to  tell 
them.   But  it  is  impossible  to  be  everywhere  at  the  same  time 
and  reliance  must  be  placed,  on  occasion,  on  the  observation 
and  judgment  of  others — within  these  observations  is  necessari- 
ly the  element  of  measurement  and  measurement  cannot  be  possible 
without  the  use  of  numbers. 

The  techniques  of  drawing  conclusions  from  numbers  is 
where  misunderstanding  and  harm  are  done.   Intuition  can  pro- 
vide for  better  conclusions  than  those  of  the  man  who  loves 
figures,  but  does  not  have  the  special  knowledge  of  how  to  deal 
with  them,  and  works  out  percentages  and  averages  and  then 
draws  unjustifiable  conclusions  from  them.-5 

It  is  still  too  little  realized  that  there  exists 
a  technique  of  dealing  with  figures  in  such  a  way  as 
to  find  out  with  a  very  high  degree  of  probability 
what  sort  of  conclusions,  if  any,  can  validly  be 
drawn  from  certain  sets  of  figures  with  what  degree 
of  accuracy  they  can  be  drawn  and  to  present  those 
conclusions  in  a  simple  form.   In  this  way  not  only 
is  it  possible  to  avoid  drawing  wholly  unwarranted 
conclusions,  but  also  at  time  to  make  some  sense  of 
a  set  of  figures  which  at  first  sight  may  aopear  to 
tell  nothing. 6 

The  Report  of  a  Committee,  for  Submission  to  the 
Twenty-Fifth  Anniversary  Conference  of  Management  Research 
Groups  at  Buxton,  1-4  June,  1951  (London:   Management  Research 
G-rbups,  Zanfield  House) ,  p.  o. 

6Ibid. 
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These  techniques  of  dealing  with  figures  is  the 
service  that  the  trained  statistician  specializing  in  mathe- 
matical analysis  can  give  to  the  comptroller  or  manager,  and 
in  subsequent  chapters  of  this  paper  examples  of  these 
techniques  will  be  presented. 

The  basis  of  all  statistics  is  counting  or  measurement 
and  the  statistician  specializes  in  methods  of  counting 
or  measurement,  whether  by  so-called   complete  counts, 
or  censuses,  or  by  counts  or  measurements  of  a  sample  of 
the  whole,  from  which  the  totality  can  be  estimated.   To 
this  end  he  uses  a  snecial  set  of  applied  mathematical 
techniques  based  on  the  theory  of  probability. 

Statistical  studies  are  of  two  classes,  enumerative 
and  analytical.   Enumerative  study  determines  how  many 
of  some  classes  of  things  exist  at  a  particular  time, 
but  is  not  concerned  with  why  they  exist.   An  analytical 
study  investigates  oatterns  and  variations  either  in  one 
series  of  events  over  a  oeriod  of  time,  or  between  differ- 
ent groups  of  things.   It  may  or  may  not  be  possible  as 
a  result  to  make  some  assessment  of  causes,  but  if  the 
results  of  the  study  are  at  all  positive  then  seme 
causal  relationship  is  assumed  to  exist.   The  routine 
calculation  of  enumerative  data  often  forms  the  has if 
for  analytical  studies.' 

It  is  important  at  this  point  that  the  statistician 
and  the  accountant  not  be  confused--that  a  distinction  be  made 
between  the  two  arts.   In  accounting  the  individual  account  or 
entry  is  of  chief  concern.   It  must  be  accurate  and  reliable 
and  subject  to  audit.   In  statistics  It  is  not  necessary  to 
have  accounting  accuracy  for  each  individual  item,  but  rather 
estimates  based  on  sampling  is  the  order  of  the  day.   Mathe- 
matical accuracy  is  still  required  in  statistical  work  and  it 
is  not  intended  to  imply  otherwise,  however,  some  sacrifice 
of  reliability  can  be  made  of  the  complete  count  for  a 
sampling  study  expertly  done. 

7Ibid. 
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The  unit  loses  its  Identity  in  a  statistical  measure 
such  as  an  average  or  a  total.   It  is  the  measure  or  total 
and  the  picture  portrayed  by  these  measures  from  which  deci- 
sions are  based  and  not  from  each  individual  unit.   The 
statistician  may  say  that  a  certain  fact  exists  within  a 
specified  range  such  as  the  Inventory  of  a  specific  item  is 
not  more  than  1050  items,  for  example,  nor  less  than  950  items. 
The  accountant  would  be  required  to  produce  a  complete  count 
capable  of  audit. 

Conclusion 

In  conclusion  it  is  well  at  this  point  to  summarize 
this  discussion  and  to  point  out  some  valid  reasons  why  the 
comptroller  and  management  should  acquire  statistical  know- 
ledge, and  the  relationship  between  management,  the  comptroller 
and  the  statistician. 

I  think  it  is  generally  understood  that  the  comptroller 
is  to  operate  in  a  staff  capacity,  however,  he  may  recommend 
to  management,  but  he  does  not  make  managements  decisions. 
Management  looks  upon  the  comptroller  as  a  continuing  source  of 
ideas  and  advice  on  managerial  problems.   The  comptroller  must 
have  had  broad  experience  with  operating  programs  and  problems 
and  general  management  responsibilities.   He  should  have  a  keen 
analytical  ability  and  be  capable  of  making  discriminating 
Judgments. 

Since  statistical  methods  are  capable  of  application 
to  all  of  the  activities  of  an  organization,  and  are  a  service 
function  it  is  fitting  then  that  this  function  should  come 
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under  control  of  the  comptroller  or  financial  director,  and 
this  it  does  in  most  cases. 

There  are  many  valid  reasons  for  management  to  under- 
stand statistics.   Such  an  understanding  of  course  would 
promote  a  wider  use  of  statistical  data  in  management  since 
problems  where  statistics  can  be  profitable  would  be  quickly 
recognized. 

Communication  with  the  professional  statistician 
would  be  improved  by  a  better  understanding  of  the  art  by 
management.   Albert  Einstein  once  said,  "The  formulation  of 
the  problem  is  often  more  essential  than  the  solution."   With 
a  statistical  background  management  can  understand  the  problems 
involved  in  fact-finding  and  thus  can  avoid  misunderstanding 
in  research  projects  conducted  in  the  organization  and  assure 
effective  use  of  the  complied  data. 

Management  must  understand  that  the  science  of 
statistics  is  not  infallible  and  that  there  is  risk  involved 
in  decision  making  based  on  statistical  conclusions.  Most 
statistical  decisions  are  based  on  samples  and  this  involves 
risk.  A  statistical  minded  management  should  understand  the 
"risk"  Involved  and  the  "confidence"  in  statistical  decision 
making. 

Of  course  a  better  understanding  of  statistical 
methods  serves  to  emphasize  that  good  effective  statistical 
studies  require  a  trained  staff  of  professional  statisticians. 
This  will  yield  valid  results  with  a  minimum  of  cost. 
Statistical  functions  cannot  be  assigned  to  the  budget  analyst, 
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clerk  or  bookkeeper  in  the  office  or  the  foreman  or 
technician  In  the  plant. 

And  finally  management  will  become  a  more  confident 
administrator  and  enjoy  peace  of  mind,  insofar  as  administra- 
tive matters  are  concerned,  by  knowing  the  basic  elements  of 
statistics. 

More  important  he  will  make  less  mistakes  and  will 
be  better  a.ble  to  control  the  mistakes  of  others  in  his 
organization. 


CHAPTER  II 
HISTORY  OF  STATISTICAL  METHOD 
The  Normal  Curve 

Origin  In  the  Theory  of  Probability 

The  history  and  development  of  statistical  method  Is 
a  colorful  and  Interesting  study.   The  statistical  methods  of 
today  are  based  on  the  science  of  probability.   The  early 
scientists  and  mathematicians  were  aware  that  certain  natural 
activities  occur  In  conformity.   The  early  observations  of 
Aristotle  and  astronomers  in  establishing  fundamental  physical 
laws  of  the  universe,  regimented  certain  observations  in  the 
field  of  science.   The  ability  to  predict  happenings  and 
behavior  on  the  basis  of  new  found  laws  provided  the  author- 
itative position  of  charlatans,  mystics,  magicians  and  gamblers. 
Of  course  at  this  stage  of  development  it  was  Impossible  to 
predict  the  behavior  of  certain  factors  which  were  caused  by 
chance  alone,  however,  this  did  not  discourage  or  weaken  the 
position  of  these  early  philosophers  and  scientists.   The  real 
beginning  of  the  science  of  probability  did  not  occur  until 
the  early  part  of  the  17th  century.    The  exact  date  for  the 

Robert  Kirk  Mueller,  Effective  Management  Through 
Probability  Controls  (New  York:   Funk  and  Wagnalls,  1950), 
p.  62. 
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discovery  of  the  normal  curve  (fig.  1  ch.  Ill)  which  had  its 
beginning  and  discovery  from  the  theory  of  probability  was 
November  12,  1733. 

There  were  some  faint  traces,  of  no  more  than  six  or 
eight  problems,  that  intimated  the  idea  of  probability  before 
the  year  1600.   These  traces  appeared  in  the  Orient.   One 
example  occurred  when  the  Chinese  writer  Yuan  Yuan  criticized 
a  still  older  writer,  Sun-Tze  for  the  following  reason; 
"Many  unnecessary  details  appear  in  his  works  on  mathematics, 
such  as  a  certain  absurd  oroblem,  which  surely  cannot  be 
attributed  to  him,  on  the  probability  that  an  expected  child 
will  turn  out  to  be  a  boy  or  to  be  a  girl."2 

In  Europe  the  really  first  references  to  the  theory  of 
probability  is  a  statement  in  a  commentary  (Venice,  1477)  on 
Dante's  Divine  Comedy  concerning  the  different  throws  which 
can  be  made  with  three  dice.   The  first  writer  to  discuss  a 
problem  concerning  gambling  in  a  mathematical  paper  was  Luca 
Pacioli  in  his  Suma  (1494).   In  this  paper  he  gave  the  first 
version  of  the  problem  concerning  the  equitable  division  of  the 
stakes  between  two  players  of  unequal  skill  when  the  game  is 
interrupted  before  its  conclusion.   This  problem  was  repeated 
and  amplified  in  all  the  works  on  probability  for  two  hundred 
years.  Astronomers  played  a  conspicuous  role  in  developing  the 
mathematical  theory  of  probability  two  centuries  later — it  is 
worthy  of  note  that  both  Kepler  and  Galileo  made  brief 


2Helen  M.  Walker,  Studies  in  the  History  of  Statistical 
Method  (Baltimore,  Md. :   The  Williams  and  Wilkins  Company, 
January  1931) %   p.  1,  ff. 
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reference  to  the  subject  of  chance. 

The  first  real  scientific  works  on  probability  began 
in  the  17th  century.   The  foundations  were  laid  by  the  two 
most  distinguished  mathematicians  of  that  time,  Blaise  Pascal 
and  Pierre  de  Fermat.   The  theory  develooed  when  the  Chevalier 
de  Mere,  a  gambler  having  unusual  ability,  even  for  mathematics, 
proposed  certain  questions  to  Pascal.  Among  these  questions 
was  the  celebrated  "Problem  of  Points''  concerning  the  division 
of  stak.es  between  two  players  who  separate  without  completing 
their  game.   Pascal  and  Fermat  exchanged  numerous  letters  on 
this  subject  during  the  year  1654,  snd  la  the  course  of  this 
correspondence  they  generalized  the  problem  more  and  more  until 
at  its  close,  that  which  had  first  been  a  source  of  perplexity 
to  a  gambler  had  been  elevated  to  an  important  mathematical 
concept. 

Within  the  next  few  years  several  problems  relative  to 
the  outcome  of  games  of  chance  were  proposed  and  solved  in  the 
journals  of  various  learned  societies  by  Jacques  Bernaulli, 
Montmort,  De  ^oivre,  Arbuthnot,  Francis  Roberts  and  others. 
The  Discovery  of  the  Normal  Curve 

The  normal  curve-7  was  discovered  by  Abraham  De  Moivre 
(1667-1754)  as  a  result  of  his  intensive  study  to  his  vork  and 


3 
The  Normal  Curve:  A  symmetrical  bell  shaped-curve 

possessing  certs  in  statistical  characteristics.   See 

discussion  Chapter  III. 
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his  applications  of  the  binomial  theorem  led  him  to  discover 
a  formula  for  the  ratio  between  the  middle  term  and  the  sum  of 
all  the  terms  of  (l+l)n.   Many  of  the  recent  treatises  on 
probability  and  sampling  approach  the  matter  by  a  method 
similar  to  De  Moivre's  use  of  the  binomial  expansion. 

This  formula  was  first  published  November  12,  1733. 
In  1730,  De  tfoivre  introduced  his  Miscellanea  Analytlca  and 
three  years  later  he  presented  privately  to  a  few  friends  a 
short  seven  page  paper  entitled  Approxlmatis  ad  Summan  Termln- 
orum  Blnomle  a+bn  in  Seriem  Expansl,   In  this  obscure  treatise 
on  abstract  mathematics,  written  in  Latin,  two  centuries  ago, 
and  believed  by  De  I  oivre  to  have  no  practical  implications 
outside  the  realm  of  games  of  chance,  we  have  the  first  formu- 
lation of  the  concept  of  the  law  of  errors.   He  did  not 
realize  that  there  would  come  a  time  when  this  theorem  would 
powerfully  affect  the  thinking  of  the  world  on  all  its  social 
problems,  shape  the  policy  of  educators  in  the  schools,  and 
aid  in  thousands  of  investigations  in  science  whose  very  names 
were  then  unknown. 

Later  De  -loivre  announced  the  formula  for  the  curve  of 
error,  and  connected  his  theory  of  probability  with  theology, 

A 

Binomial  Theorem:   The  theorem  by  means  of  which  a 

binomial  may  be  raised  to  any  power  without  performing  the 
multiplications.   A  binomial  is  an  expression  consisting  of 
two  terms  connected  by  a  plus  or  a  minus  sign.   A  binomial 
distribution  is  a  distribution  where  there  are  only  two 
alternatives.   For  example  in  sampling  light  bulbs,  they  are 
either  good  or  bad. 


power. 


5 
Binomial  Expansion:   Raising  the  binomial  to  a  given 
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urging  that  this  tendency  of  events  to  conform  to  law  argues 

a  Jreat  First  Cause. 

And  thus  in  all  cases  it  will  be  found,  that  although 
chance  produces  irregularities,  still  the  odds  will  be 
infinitely  great, that  in  the  process  of  time,  those 
irregularities  will  bear  no  proportion  to  the  recurrency 
of  that  order  which  naturally  results  from  Original 
Design.  .  .  .   Again,  as  it  is  thus  demonstrable  that 
there  are,  in  the  constitution  of  things,  certain  laws 
according  to  which  events  happen,  it  is  no  less  evident 
from  observation,  that  these  laws  serve  to  wise,  useful 
and  beneficient  purposes,  to  preserve  the  stedfast 
Order  of  the  Universe,  to  propagate  the  several  Species 
of  Beings,  and  furnish  to  the  sentient  kind  such  degrees 
of  happiness  as  are  suited  to  their  state." 

In  these  later  writings,  De  Moivre  is  eager  to  free 
the  subject  of  probability  from  its  connection  with  gambling, 
and  establish  a  theological  doctrine  of  a  divine  order  working 
through  human  affairs  and  exhibiting  itself  in  the  regularity 
of  statistical  ratios. 

One  of  the  first  attempts  to  reduce  the  theory  of 
probability  to  rules  of  thumb  which  might  be  followed  by  per- 
sons having  no  command  of  mathematics  beyond  simple  arithmetic 
was  made  by  Augustus  De  Morgan  (1806-1871)  in  An  Essay  on 
Probabilities  and  on  Their  Application  to  Life  Contingencies 
and  Insurance  Offices.   De  Morgan  was  brilliant  and  somewhat 
erratic;  he  wrote  on  a  wide  range  of  mathematical  topics.   He 
was  the  greatest  English  authority  of  his  day  on  the  theory 
of  probability.   He  stated  in  his  Essay  on  Probabilities  that 
heretofore  the  work  in  probabilities  has  been  possible  only  to 
the  expert  mathematician,  but  that 

Ibid.,  p.  17,  ff. 
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those  who  already  admit  that  the  theory  of  probabilities 
is  a  desirable  study,  must  of  course  allow  that  persons 
who  cannot  pay  much  attention  to  mathematics,  are 
benefited  by  the  possession  of  rules  which  will  enable 
them  to  obtain  at  least  the  results  of  complicated 
problems,  and  which  will  therefore,  permit  them  to 
extend  their  inquiries  further  than  a  few  simple  cases 
connected  with  gambling. 

His  book  was  really  a  handbook,  full  of  practical  advice  and 

sufficient  practical  rules  for  the  treatment  of  a  set  of 

observations.   He  advised  against  unwarranted  conclusions  from 

a  situation. 

Later  De  Morgan  wrote  other  essays  on  the  Theory  of 

Probability  and  in  these  he  discussed  the  theory  of  least 

squares, '  the  application  of  the  calculus  of  definite 

integrals,  and  some  complicated  problems  in  probability.   One 

portion  of  his  essay  is  of  particular  interest  because  it 

indicates  the  direction  and  thinking  of  scientific  men  of  that 

day.   This  particular  section  discusses  the  probabilities  of 

testimony,  of  miracles,  of  moral  questions,  of  decisions  of  a 

Jury.   He  went  so  far  as  to  suggest  that  an  Investigation  into 

the  validity  of  the  English  practice  of  requiring  a  unanimous 

decision  from  a  Jury  might  be  made  by  studying  five  hundred 

trials  in  which  the  Jury  have  delivered  their  verdict  at  once 

to  see  if  they  show  a  smaller  percentage  of  error  than  five 

7 

Mathematically  computed  lines  may  be  passed  through 

the  data,  or  scatter  points,  figures  2,   J>,    4,  5»  chapter  III 
which  are  called  regression  lines,  or  lines  of  average  relation- 
ship, because  they  reveal  the  typical  change  in  the  dependent 
variable  Y  which  has,  in  the  past,  accompanied  a  given  change 
in  the  variable  plotted  on  the  X  axis.   This  average  relation- 
ship may  be  determined  mathematically  by  the  method  of 
least  squares. 
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hundred  trials  in  which  the  jury  deliberated  for  two  hours  or 
more.   Other  writers  later  attempted  to  assign  numerical 
values  to  the  probabilities  of  the  accuracy  and  sincerity  of 
witnesses,  and.  by  elaborate  mathematical  processes  attempted 
to  reach  numerical  measures  of  the  probability  that  a  given 
defendant  is  guilty  or  innocent. 

Moments 

Professor  Karl  Pearson  is  generally  regarded  as  having 
accomplished  the  most  significant  work  in  the  development  of 
moments.   The  analogy  between  the  statistical  function  obtained 
by  taking  the  sum  of  the  nth  powers  of  the  deviations  in  any 
given  distribution,  and  the  mechanical  concept  of  a  moment, 
had  been  noticed  by  other  writers  before  Pearson,  but  none  of 
them  had  perceived  in  that  function  the  powerful  tool  which 
he  developed.0 

Pearson  studied  extensively  the  effect  of  sampling 
upon  the  moments  of  a  frequency  distribution.   Pearson  and  his 
associates  set  down  the  general  method  by  which  the  sampling 
error  of  any  moment  might  be  studied.   Among  the  formulas 
derived  were  for  the  standard  deviation,"  the  standard  deviation 
of  a  coefficient  of  correlation,  °  the  correlation  between 


8Ibid..  p.  76,  ff. 

9 
Standard  deviation — the  most  reliable  measure  of 

variation.   Often  referred  to  as  the  root-mean-square. 

Correlation  coefficient — an  index  measuring  the  extent 
of  relationship  between  two  variables.   It  ranges  from  -1 
(a  perfect  negative  relationship)  to  1  (a  perfect  positive 
relationship) . 
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sampling  errors  in  the  standard  deviations  of  two  correlated 
traits,  and  the  correlation  between  sampling  errors  in  a 
coefficient  of  correlation  and  in  a  standard  deviation.   The 
moments  are  used  in  curve  fitting  and  from  the  moments  I  re- 
derived  the  parameters**  in  the  Pearson  system  of  curves. 

Percentiles 
The  discovery  of  the  importance  of  the  median  as  a 
measure  of  central  tendency  was  made  about  seventy  five  years 
ago.   There  were,  however,  some  earlier  writers  who  had 

conceived  this  measure  but  v/ho  did  not  place  great  emohasis 

12 
upon  it. 

In  1816  Gauss  suggestel  a  simple  iiethod  of  computing 
the  probable  error.   He  said  "let  all  the  errors  be  arranged 
in  order  of  size  without  regard  to  sign,  and  take  the  middle 
one  if  the  number  is  odd,  or  the  mean  of  the  two  middlemost  if 
the  number  is  even.''   This  value,  which  he  called  merely  M, 
is  thus  the  median  of  the  absolute  value  of  the  errors,  and 
similar  to,  though  not  necessarily  identical  with  Q-alton's 
quartlle  deviation. 

Eneke,  Quetelet,  Fechner,  and  Galton  made  similar 
studies.   Sir  Francis  Gal ton's,  (1822-1911),  first  statistical 
work  was  a  study  of  Hereditary  Genius  (1869)  which  led  him  to 
feel  the  need  for  a  satisfactory  method  of  reporting  differences 
in  achievement  and  intellectual  ability.   The  following  are 


Parameter — a  characteristic  in  a  population.   For 
example,  the  true  proportion  of  defectives  in  a  lot  of 
50,000  light  bulbs. 

12Ibid.,  p.  83,  ff. 
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some  ouotes  from  hie  book: 

The  theory  of  hereditary  genius,  though  usually- 
scouted,  has  been  advocated  by  a  few  writers  in  the 
past  as  well  at  in  modern  times.    it  I  may  claim  to 
be  the  first  to  treat  the  subject  in  a  statistical 
Banner,  to  arrive  at  numerical  results,  and  to  introduce 
the  "law  of  deviation  from  the  average  into  discussions 
on  heredity.  .  .  .   The  range  of  mental  powers  between 
.  .  .  the  greatest  and  least  of  English  intellects, 
is  eLormous.   There  is  a  continuity  of  natural  ability 
reaching  from  one  knows  not  what  height,  and  descending 
to  one  can  hardly  say  what  depth.  ...   I  propose  in 
this  chapter  to  range  men  according  to  their  natural 
abilities,  putting  them  into  classes  separated  by  equal 
degrees  of  merit,  and  to  show  the  relative  number  of 
individuals  included  in  the  several  classes.  .  .  . 
The  method  I  shall  employ  for  discovering  all  this, 
is  en  gpplicatlon  of  the  very  curious  theoretical 
law  of  "deviation  from  and  average. r  First  I  will 
explain  the  law  and  then  I  will  show  that  the  productions 
of  natural  intellectual  gifts  come  Justly  within  its 
scope.  .  .  .   Here  we  arrive  at  the  undesirable,  but 
unexpected  conclusion,  that  eminently  gifted  men  are 
raised  as  much  above  mediocrity  as  idiots  are  depressed 
below  it;  a  fact  that  is  calculated  to  considerably 
enlarge  our  ideas  of  the  enormous  differences  of 
intellectual  gifts  between  man  and  man. 13 

The  idea  of  grades  has  its  first  published  statement 

in  this  book  Galton  rlaced  the  ablest  men  in  each  million  in 

the  highest  grade,  and  the  most  stupid  in  the  lowest  grade,  and 

then  divided  the  remaining  99,998  into  14  classes,  the  average 

ability  of  each  being  separated  from  that  of  its  neighbors  by 

equal  grades,  thus  forming  a  table  to  be  applied  to  special  or 

general  ability.   It  would  be  true  for  any  type  of  examination 

whether  held  in  painting,  in  music,  or  in  statesmanship.   The 

proportion  between  the  different  classes  would  be  identical  in 

all  these  cases,  although  the  claspes  are  made  up  of  different 

individuals.   It  is  clear  that  Gal ton  was  moving  rapidly  in 

the  direction  of  a  percentile  scale,  although  even  the  concept 


13 Ibid.,  p.  87,  ff. 
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of  the  median  Is  not  explicit  as  yet. 

0-alton  later  developed  percentile  scales  in  assigning 
marks  of  excellence  on  test  scores  he  arranged  all  the  members 
of  the  prroun  in  an  order  of  merit.   One  of  Walton's  earliest 
attempts  to  do  this  scientifically  resulted  in  a  "scale  of 
merit"  for  the  men  who  obtain  mathematical  honors  at  Cambridge. 

Galton's  general  method  of  grades  and  deviates  is 
widely  used  in  ^psi^ninp-  class  mar'^s ,  and  in  constructing 
educational  tests  and  scales.   It  is  fundamentally  the  method 
employed  for  the  construction  of  a  T-sc^le.^ 

Correlation 

It  would  be  rather  difficult  to  condense  a  complete 
history  of  correlation  in  this  rarer.   It  is  the  purpose  here 
to  nresent  only  a  brief  account  of  the  development  of 
correlation. 

Correlation  theory  developed  historically  from  the 
theory  of  Probability,  and  it  is  generally  understood  that 
correlation  was  the  unique  discovery  of  Sir  Francis  G-alton, 
made  early  in  the  last  quarter  of  the  nineteenth  century,  and 
greatly  elaborated  and  refined  by  Pearson,  Edgeworth,  and 
Weldon.  Hany  mathematicians  came  close  to  the  discovery  of 
correlation.   Several  French  and  German  astronomers,  physicists 
and  mathematicians  approached  the  problem  from  the  point  of 
view  of  mathematical  analysis,  but  failed  to  see  the  practical 


14 

theoretical  dlstributior  that  goes  by  the  name  of 

Student-T  distribution,  developed  by  W.  S.  G-osset  who 
publ i s bed  "hXs"  worTi  und er  the  pen  name  of  Student. 
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significance  of  the  formulae  they  derived.  It  has  been  paid 
that  this  may  be  at  a  result  of  too  much  attention  to  theory 
and  not  enough  toward  errnirical  data,** 

In  hie  Memories .  Gaiton  tel!!s  of  the  first  time  he 
recognized  that  regression  would  be  the  key  to  the  problem  of 
heredity  (the  concept  of  correlation  developed  from  repres- 
sion).  Galton  discovered  that  the  lavs  of  heredity  were 
solely  concerned  v/ith  deviations  expressed  in  statistical 
unite. 

Galton's  earliest  studies  on  regression  were  in  1375 

when  he  was  conducting  experiments  with  sweet-pea  seeds  to 

determine  the  law  of  inheritance  of  size.   The  question  at 

this  time  in  Galton's  mind,  was:   "How  is  it  possible  for  a 

whole  population  to  remain  alike  in  its  features,  as  a  whole, 

during  many  successive  generation  if  the  average  produce  of 

each  couole  resemble  their  narente?"   From  the  crude  data 

obtained  and  assembled  the  lav/  of  regression;  the  formula  for 

the  standard  error  of  the  estimate,10  and  the  idea  that  the 

various  arrays  in  the  correlation  table  had  equal  variability. 

In  studying  the  inheritance  of  stature,  Galton 
first  transmuted  all  female  statures  to  male,  multiplying 
them  by  the  constant  1.08  (this  had  the  effect  of 
equating  means,  but  did  not  take  into  account  the 
different  variability  in  male  and  female  statures). 
The  average  of  the  statures  of  the  parents  was  then 


15 lb id.,  p.  105,  ff. 
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Standard  error  of  estlmate--a  measure  of  the  scatter 
about  the  line  of  regression.   The  standard  deviation  of  the 
observed  points  about  the  line. 
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termed  the  *mld-t>arent:  *  the  average  of  the  statures 
of  the  offspring,  the  "mid -Fraternity. *   From  these 
he  made  "Tables  of  stature,"  later  called  correlation 
tables.   For  each  array  and  for  the  table  as  a  whole 
he  found  the  median.   Ke  also  found  Q  (semi-interquartile 
range) 17  used  as  an  approximate  value  of  the  probable 
error,  for  the  genera]  population,  for  the  mid  parents, 
and  for  each  *co-Fraternity, *  a  *  co-Fraternity*  being 
all  of  the  adult  sons  and  transmuted  daughters  of  a 
group  of  Mid-Parents  who  have  the  same  stature  (reckoned 
to  the  nearest  inch).   examining  these  values  of  Q  for 
the  various  Co-Fraternities,  he  then  found  them  roughly 
the  same,  regardless  of  whether  the  mid-parents  were  tall 
or  short.   Examining  the  medians  he  found  that  ^However 
paradoxical  it  may  appear  at  first  sight,  it  is  theoretical- 
ly a  necessary  fact,  and  one  that  is  clearly  confirmed  by 
observation,  that  the  Stature  of  the  adult  off soring 
must  on  the  whole  be  more  mediocre  than  the  statures  of 
their  parents.*   As  the  medians  of  the  rows  and  columns 
proved  to  lie  on  approximately  straight  lines,  he  called 
the  latter,  lines  of  regression. 18 

Because  Galton  used  the  variability  in  his  case  Q  as  the  unit 

in  which  he  expressed  deviations,  these  lines  of  regression 

were  symmetrically  placed.   In  187?  he  called  the  slope  of 

these  lines  r  for  reversion.   In  his  paper  "Regression  Towards 

Mediocrity  in  Hereditary  Stature"  (1886)  he  used  W,  and  N 

"Correlations  and  their  Measurement"  (1838)  he  returned  to  r, 

which  now  stood  for  regression. 

Galton  defines  correlation  as  follows. 

Two  variable  organs  are  said  to  be  co-related  when 
the  variation  of  one  is  accompanied  on  the  average  by 
more  or  less  variation  the  other,  and  in  the  same 
direction.  .  .  .   The  statures  of  kinsman  are  co- 
related  variables;  thus  the  stature  of  the  father  is 
correlated  to  that  of  the  adult  son,  and  the  stature 
of  the  adult  son  to  that  of  the  father;  the  stature  of 
the  uncle  to  that  of  the  adult  nephew,  and  the  stature 


' Inter-quartile  range — the  middle  50#  of  the 
observations. 

18 

Ibid.,  p.  106,  ff. 
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19 
of  the  adult  nephew  to  that  of  the  uncle,  and  so  on.  w 

Operations  Research 

To  give  a  complete  account  of  the  development  of  the 
technique  of  operations  research  would  require  several  volumes, 
far  more  than  the  few  paragraphs  that  are  allotted  to  the 
subject  In  this  paper.   Ever  since  about  the  third  century 
B.  C.  political  and  military  leaders  have  consulted  scientists 
for  solutions  to  the  problems  of  war.   There  are  many  instances 
on  record  where  these  non  military  specialists  have  been  able 
to  assist  the  military  with  new  ideas  about  the  machines, 
tactics  and,  strategies  of  war. 

Operations  Research  began  to  produce  more  and  more  case 
histories  about  the  time  of  World  War  I  as  both  England  and  the 
United  States  attempted  to  analyze  military  operations  by 
using  mathematical  methods.   F.  W.  Lanchester  of  England 
produced  some  papers  in  1914  on  the  relationships  between 
victory,  numerical  superiority  and  the  superiority  of  fire 
power.   In  America  Thomas  Edison  made  studies  of  antisubmarine 
warfare  for  the  Naval  consulting  board  which  included  the 
compilation  of  statistics  to  be  used  in  determining  the  best 
methods  for  evading  and  for  destroying  submarines. 

By  1939  at  the  outbreak  of  World  War  II  in  Europe  there 
was  the  nucleus  of  a  British  operational  research  organization 

19Ibld..  p.  106. 
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Joseph  F.  McCloskey  and  Florence  N.  Trefethen  (ed.), 

Operations  Research  for  Management  (Baltimore,  Md. :   John 

Hopkins  Press,  1954),  p.  4,  ff. 
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already  in  existence.   Some  of  the  problems  under  study  at 
this  time  were  to  assist  military  personnel  with  radiolocation 
and  the  integrating  of  the  newly  developing  radar  system  of 
early  warning  against  enemy  air  attack  with  the  older  system 
of  operational  control  based  principally  on  the  observer  corps, 
whose  members  were  trained  in  the  sighting,  identification, 
and  reporting  of  planes.   Other  phases  of  operations  research 
consisted  of  an  analysis  of  all  phases  of  German  air  activity 
during  night  operations  over  Britain;  problems  concerning  the 
detection  of  ships  and  submarines  by  the  use  of  radar  equip- 
ment in  airplanes,  and  all  phases  of  antisubmarine  warfare. 

Studies  of  the  data  obtained  from  the  systematic 
bombing  raids  by  the  Allies  over  Germany  indicated  that  larger 
plane  formations  suffered  smaller  percentage  losses.   These 
findings  resulted  in  the  first  1000-plane  R.  A.  F.  raid  over 
Germany.   In  the  United  States  the  Army  Air  Force  and  the  Navy 
began  work  in  the  field  of  operations  research  in  1940  after 
Dr.  James  B.  Connant,  then  chairman  of  the  National  Defense 
Research  Committee,  visited  England  and  became  aware  of  the 
techniques  of  operations  research.   As  in  Britain  early  opera- 
tions research  for  the  Air  Force  grew  up  around  problems  arising 
from  new  radar  equipment.   Toward  the  end  of  the  war  in  the 
Pacific,  the  U.  S.  Navy's  operational  research  team  undertook 
a  crash  project  in  operations  research  as  a  result  of  the 
onslaught  of  kamikaze  attack  against  Allied  ships.   The  question 
to  be  answered  was:   ''Should  the  ship  under  attack  maneuver 
violently  to  avoid  being  hit  or  keep  straight  in  order  to  get 
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better  aim  with  its  anti-aircraft  guns?"   The  operations 
research  team  working  on  the  problem  obtained  the  records  of 
477  attacks,  including  172  hits  and  27  sinkings.   The  con- 
clusions reached  after  analysis  was  that  a  large  ship  should 
maneuver  violently,  whereas  a  small  ship  should  change  course 
slowly.   Recommendations  were  also  made  about  how  ships  should 
turn  to  receive  inevitable  hits.   Tabulations  of  succeeding 
attacks  demonstrated  the  validity  of  the  recommendations. 
Those  ships  under  attack  which  observed  the  recommendations 
were  hit  29  per  cent  of  the  time,  whereas  others  were  hit  47 
per  cent  of  the  time. 

Nonmilitary  operations  research  is  probably  best 
exemplified  by  the  work  of  Horace  C.  Levinson,  one  of  the 
first  proponents  of  operations  research  in  the  United  States. 
He  began  his  work  in  the  1920* s  after  giving  up  a  career  in 
astronomy  for  another  in  merchandising  where  he  applied  the 
trained  scientist's  methods  to  the  problems  of  his  employer. 
His  work  included  studies  of  customers'  buying  habits,  the 
effect  of  quick  response  advertising  on  sales,  and  the  relation 
of  neighborhood  environment  to  types  of  articles  sold.   There 
were  many  others.   The  field  of  management  consulting  has  much 
in  common  with  operations  research  and  had  its  beginning  in  the 
latter  part  of  the  19th  century,  when  such  pioneers  of 
scientific  management  as  Taylor,  Gantt  and  Emerson  started 
their  work. 

The  study  of  operations  research  eventually  found  its 
way  into  the  Universities.   In  1948,  the  Massachusetts 


26 
Institute  of  Technology  established,  in  collaboration  with 
the  liavy  a  course  in  non  military  aprili cations  of  ooerations 
research.   Similar  courses  were  established  at  the  University 
College,  London  In  1949  and  Birmingham  University  in  1950. 
The  Case  Institute  of  Technology  became  the  first  institution 
of  higher  learning  to  offer  a  curriculum  in  operations  research 
leading  to  a  degree  of  Master  of  Science.   Later  Columbia  and 
John  Hopkins  followed  with  courses  in  operations  research. 

Today  operations  research  appears  to  be  In  an 
intermediate  stage  of  development.   It  has  proven  its  usefulness 
by  a  history  of  many  cases,  however,  it  is  still  developing 
with  such  speed  that  even  the  most  experienced  among  operations 
research  workers  hesitate  to  mark  out  all  the  problem  areas 
that  are  not  susceptible  to  an  operations  research  approach. 


CHAPTER  III 

STATISTICAL  MZTHODS  AND  CASE  STUDIES 

Although  most  business  executives  have  a  reasonable 
background  of  mathematical  training,  the  secrecy,  mystery  and 
terminology  with  which  the  statistician  and  technical  man 
surround  the  subject  of  statistics  are  often  as  confusing  as 

the  White  Queen's  example  of  addition. 

"Can  you  do  addition?"  the  White  Queen  asked  (of 
Alice).   'What's  one  and  one  and  one  and  one  and  one  and 
one  and  one  and  one  and  one  and  one?" 

"I  don't  know/'  said  Alice.   "I  lost  count." 
"She  can't  do  addition,"  the  Red  Queen  interrupted.  .  . 
Alice  in  Wonderland  by  Lewis  Carroll 

You  need  not  be  a  mathematician  to  understand  the  basic 

principles  used  by  statisticians  or  to  use  a  statistical 

approach  to  problems  in  comptrollership  and  business  problems 

in  general;  in  fact  addition,  subtraction,  long  division,  and 

an  occasional  square  root  thrown  in  is  about  all  that  is  needed 

for  an  executive  to  become  reasonably  familiar  with  statistical 

techniques. 

It  is  impossible  within  the  scope  of  this  paper  to 

present  in  detail  the  many  statistical  methods  used  by 

statisticians  in  obtaining  solutions  to  the  many  different 

Robert  Kirk  Mueller,  Effective  iManagement  Through 
Probability  Controls  (New  York:  Funk  and  Wagnalls,  1950), 
p.  74 ,  ff. 
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types  of  problems;  nor  is  it  possible  to  attempt  to  make 
trained  statisticians  out  of  those  who  read  this  paper,  but, 
rather  it  is  intended,  that  this  chapter  shall  be  a  non 
technical  discussion  of  some  of  the  more  important  and  useful 
statistical  techniques  used  in  decision  making,  and  the 
presentation  of  the  highlights  of  some  actual  problems  and 
the  method  used  in  their  solution. 

It  is  assumed  that  the  reader  has  had  some  training  in 
statistics  and  is  familiar  with  the  measures  of  central 
tendency  (mean,  median  and  mode),  frequency  distributions, 
measures  of  variation  (the  ran^e,  inter  quartile  range,  mean 
deviation)  and  the  most  important  of  all  measures  of  variation 
the  standard  deviation,  and  correlation  and  regression.   If 
this  is  not  the  case  I  refer  you  to  the  text  books  listed  In 
the  bibliography  and  to  Appendix  B  of  this  paper  which  contains 
a  sample  problem  using  simplified  numbers  to  demonstrate  the 
method  of  obtaining  some  of  these  various  measures.   Correla- 
tion and  regression  ie  taken  up  in  detail  in  the  latter  part 
of  this  chapter. 

Probability 

To  provide  a  proper  background  for  the  discussion  of 

statistical  method  it  is  well  to  start  with  the  theory  of 

probability  since  statistical  methods  are  based  on  this  theory. 

The  theory  of  probability  is  aptly  described  by  the  French 

mathematician  Eraile  Borel  as  follows : 

The  theory  of  probability  is  of  interest  to 
artillerymen;  it  is  of  interest,  also,  not  only  to 
card  and  dice  players,  who  are  its  godfathers,  but 
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to  all  men  of  action,  heads  of  industries  or  heads 
of  Armies,  whose  success  depends  on  decisions  which 
in  turn  depend  on  two  sorts  of  factors,  the  one 
known  or  calculable,  the  other  uncertain  and 
problematic;  it  is  of  interest  to  the  statistician, 
the  sociologist,  the  philosopher. 2 

In  lay  language  probability  may  be  defined  in  its 
subjective  sense  as  a  state  of  mind  as  to  the  degree  of 
certainty  or  uncertainty.   A  second  interpretation  is  that 
which  usually  happens  and  may  be  defined  as  the  relative  fre- 
quency with  which  an  event  occurs  in  a  certain  class  of  events. 
The  third  interpretation  of  probability  holds  that  probability 
refers  not  to  events  but  to  propositions  and  has  nothing  to  do 
with  intensity  or  belief  or  the  statistical  frequency. 

All  human  affairs  rest  ur>on  probabilities,  and  the 
same  thing  is  true  everywhere.   If  man  were  immortal, 
he  could  be  perfectly  sure  of  seeing  the  day  when 
everything  in  which  he  had  trusted  should  betray  his 
trust  and  in  short  of  coming  eventually  to  hopeless 
misery  he  would  break  doxvn  at  last  as  every  good 
fortune,  as  every  dynasty,  as  every  civilization 
does.   In  place  of  this  vt  have  death.  .  •  •' 

With  improved  accuracy  of  measurement,  it  has  been  determined 
that  no  two  things  are  exactly  alike.   In  the  case  of  a  manu- 
facturer of  uniform  items  there  exists  a  family  of  chance 
causes  of  variations,  such  sr  tool  wear,  fluctuation  in  voltage, 
variations  in  operator  performance,  etc.   V/hen  all  of  the 
controllable  factors  are  in  a  state  of  control,  these  chance 
causes  which  are  not  controlled  vary  according  to  a  definite 
symmetrical  pattern  known  as  the  normal  frequency  distribution 

2 

Ibid.,  p.  62. 

''ibid. ,  p.  67. 
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h 
curve .  figure  1. 

The  abscissa  or  the  peak  of  the  curve  represents  the 

central  tendency  or  average  of  all  measurements  with  the 

number  of  observations  or  measurements  varying  in  accord  with 

the  curve  to  the  left  and  right  of  the  center  line.   It  can  be 

noted  that  the  number  of  observations  or  measurements  which 

are  farthest  from  the  average  are  very  few  in  number,  but 

they  still  exist. 

Once  the  mathematicians  had  determined  that  a 
system  or  rrccess  undisturbed  and  subject  only  to 
chance  variations,  could  be  arrayed  in  a  frequency 
distribution  curve  this  information  was  put  to  work 
practically  by  further  developing  a  mathematical 
theory  snout  the  characteristics  of  the  joints 
"  scattered"  about  the  average  or  central  tendency 
of  the  diagram.   The  "scatter"  points  sre  independent 
of  the  average  central  tendency.   A  characteristic 
of  the  freauency  distribution  curve  v.-ill  be  seen  in 
following  the  curve  from  the  left-hand  side  upward  where 
it  is  noticed  that  the  direction  of  this  curve  changes 
from  concave  to  convex  and  passes  through  a  flat  point 
e  little  over  half  way  towards  the  peak.   Following  the 
curve  up  over  the  peak  and  back  down  it  is  seen  that 
this  tendency  is  changed  to  where  the  curve  goes  from 
convexity  to  concavity  after  pasting  through  another 
flat  point.   The  flat  point  can  be  determined  geometrically 
and  is  a  characteristic  of  each  ideal  frequency  distribu- 
tion.  The  distance  between  this  flat  point  and  the  mean 
is  the  same  on  either  side  of  the  central  tendency  line 
and  has  been  arbitrarily  called  a  "standard  deviation" 
designated  in  mathematical  terminology  by  the  Greek 
letter  slgma  (#").* 

It  has  been  determined  that  the  number  of  parts 
occurring  in  the  area  bounded  by  sigma  on  each  side  of 
the  roiddle  line  corresponds  to  68. 3^  in  an  ideal  freauency 
curve.   It  has  also  been  determined  that  the  number  of 
parts  or  measurements  which  occur  within  two  slgma  limits, 
that  is,  twice  the  standard  deviation  distance  on  either 
side  of  the  central  average,  amounts  to  95.5  percent  of 
the  occurances  or  parts.   The  three-sigma  limit  reaches 

4,lbld.  ^Ibid..  p.  69. 
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out  lr.to  the  asymptotic  zone  and  accounts  for  99.73 
percent  of  the  parts,  and  this  three-sigma  Unit  la 
often  adopted  as  an  economical  and  practical  basis  or 
limit  for  commercial  use  of  statistical  control.   These 
limits  are  such  that  an  ideal  process  operating  under 
the  influence  of  chance  variability  trill  produce  all 
production  items  within  the3e  percentage  limits.   In 
mathematical  language  this  is  referred  to  is  'plus  or 
minus  three  standard  deviations •  or  'three  slgma 
limit. *6 

In  most  cases  of  industrial  or  scientific  decision 
making  it  is  generally  accepted  that  a  probability  of  95  per 
cent  (two  sigma)  is  the  minimum  limit  for  a  good  decision.   To 
hold  out  for  99  per  cent  confidence  would  in  most  eaaei  involve 
considerable  expense.   When  working  at  the  95';:  confio'enc^ 
level,  it  is  moot  probable  that  a  correct  decision  is  made  at 
leastl9  times  out  of  20. 

Sampl  lnp; 

Everyone  who  has  considered  business  problems  knows 
that  data  on  many  situations  are  sometimes  required  under 
circumstances  where  it  lb  not  economically  feasible  or  even 
possible  to  take  measureiiientb,  enumerate,  or  ter.i  the  entire 
universe  or  population  under  consideration.   Very  often, 

however,  a  samDle  may  be  of  great  assistance,  and  more  often 
than  not  a  sample  will  provide  more  accurate  results  than  the 
complete  data.   Although  a  complete  enumeration  will  avoid 
making  sampling  errors,  overall  accuracy  can  b©  Increased  by 
conducting  a  sample  survey  rather  than  a  conr-lete  count.   For 
example  the  smaller  scope  of  the  sample  orojoct  makes  it 
possible  to  be  more  selective  in  assignment  of  personnel  for 


tIbld.t  p.  70,  ff. 
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the  survey  operation;  to  be  more  thorough  in  their  training; 
and  to  be  able  to  concentrate  to  a  much  greater  degree  on 
reduction  of  sampling  errors.   The  net  effect  of  a  sample 
survey,  as  compared  to  the  100$  count,  is  often  in  addition  to 
being  more  accurate,  is  achieved  with  fewer  personnel,  less 
paper  work,  at  a  lower  cost,  and  in  a  shorter  time. 

It  is  well  at  this  point  to  distinguish  between  a 
population  (or  universe)  and  a  sample.   A  population  is  the 
entire  set  or  group  about  which  we  desire  some  type  of  informa- 
tion, while  a  sample  is  a  subset  or  a  part  of  the  population. 
In  the  population  there  are  certain  characteristics  which  we 
may  estimate  or  which  we  may  desire  to  observe  and  control. 
These  characteristics  are  called  parameters.   The  corresponding 
characteristics  in  a  sample  are  called  statistics. 

Sampling  is  the  technique  of  drawing  a  few  items  from 
a  universe  or  mass  in  such  a  way  as  to  secure  statistics  on  the 
basis  of  which  we  may,  by  inference,  construct  a  picture  of 
the  mass  from  which  the  items  come.   The  technique  of  sampling 

is  used  because  frequently  it  is  the  only  possible,  or  practical 
and  usually  the  most  efficient  means  of  studying  mass  data. 

A  situation  in  which  sampling  is  the  only  possible  method 
would  occur  when  materials  are  tested  for  quality,  the  object 
may  be  destroyed  as  would  be  the  case  in  testing  bricks  or 
steel  parts,  for  example,  for  tensil  strength  or  for  resistence 

Q 

M.  J.  Slonim,  "Sampling  in  a  Nutshell."  Directorate 
of  Statistical  Services,  DCS/  Comptroller,  Headquarters, 
U.S.A.F.,  p.  16,  ff. 
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to  crushing  forces.   Sampling  would  be  the  only  practical 
method  for  example  when  the  statistical  universe  it  large  and 
composed  of  thousands  or  even  millions  of  items.   To  enumerate 
the  entire  population  would  be  impractical  from  a  time  and 
cost  point  of  view.   Even  though  in  some  case  it  would  be 
possible  from  a  physical  or  financial  standpoint  to  make  a 
complete  count,  as  was  pointed  out  above,  it  is  not  necessary 
to  do  so.   A  sample  that  is  properly  taken  will  provide  the 
necessary  data  from  which  generalizations  about  the  population 
can  be  made,  and  far  more  efficiently.   Briefly  let  us  examine 
a  few  of  the  sampling  techniques  that  can  be  used  for  any  of 
the  problems  the  statistically  minded  comptroller  may  encounter. 
Simple  Random  Sampling 

Simple  random  sampling  is  simply  a  sampling  technique 
in  which  chance  alone  dictates  which  elements  of  the  popula- 
tion will  be  members  of  the  sample.   All  subjectiveness  and 
human  bias  have  supposedly  been  removed.   In  a  random  sample 
each  element  of  the  universe  or  population  has  an  equal  chance 
of  being  selected.   If,  under  these  conditions  a  sufficiently 
large  number  of  items  is  collected,  the  sample  will  be  a 
miniature  cross  section  of  the  mass  from  which  the  items  come. 
Random  samples  are  not  haphazard  selections,  but  rather  the 
system  of  selection  is  controlled  and  is  not  a  hit  or  miss 
selection.   Tables  of  random  digits  have  been  prepared  to 
assist  in  obtaining  a  truly  random  sample  from  a  population. 

9 

William  Addison  Neiswanger,  Elementary  Statistical 

Methods  (New  York:   Macmillan  Company,  1951) >P»  96. 
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Stratified  Sampling 

There  are  particular  situations  when  it  is  desirable 
to  have  all  elements  of  the  population  represent.   For  example, 
when  making  a  study  of  inventory  items,  a  simple  random  sample 
may  exclude  the  more  costly  items  at  the  extreme  range  of  the 
population  from  being  selected  when  it  is  desirable  or 
necessary  to  have  some  information  about  these  costly  items. 
Stratified  sampling  is  a  scheme  for  accomplishing  this  task 
while  at  the  same  time  retaining  the  desirable  property  of  a 
random  sample. 

To  accomplish  this  the  population  is  divided  into 
homogeneous  groups,  referred  to  as  strata— each  element  in  thr 
group  is  similar  to  the  other  elements  within  the  group.  For 
example  in  the  case  of  inventory  items  we  may  establish  groups 
or  strata  of  (1)  all  items  valued  at  less  than  £5.00  (2)  items 
valued  from  |5.00  to  $25.00  (3)  items  valued  from  £25.00  to 
|99.00  and  (4)  all  items  that  exceed  *100.00.   Kow  that  the 
strata  or  groups  have  been  selected  or  established  we  sample 
randomly  from  within  each  strata. 
Cluster  Sampling 

There  are  on  occasion  situations  in  which  the  ease  of 
sampling  and  the  cost  are  important,  and  we  are  unable  to 
select  a  simple  random  or  stratified  sample  from  a  given 
population.   In  this  situation  we  may  divide  the  population 

Chester  H.  McCall,  MA  Statistics  Manual."  Unpublished 
statistics  manual  prepared  for  the  Navy  Graduate  Comptroller- 
ship  Program,  Department  of  Statistics,  The  George  Washington 
University,  Washington,  D.  C. ,  1958,  p.  34,  ff. 
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Into  clusters  which  are,  In  fact,  miniature?  populations, 
and  then  to  select  at  random,  several  clusters,  OX  a  single 
cluster,  from  the  population  of  clusters.   From  these  clusters 
we  may  then  draw  Inferences  about  the  whole  population.   In 
order  for  this  technique  to  be  successfully  applied  the 
clusters  must  be  similar  to  one  another  1b  their  characterist- 
ics.  For  example,  if  we  assume  that  all  Ces trovers  ehould 

have  crews  with  similar  attitudes  regarding  retirement  and 
fringe  benefits,  then  the  selection  of  a  single  crew,  or 
several  crews  would  suffice  to  draw  some  inference  about  the 
attitudes  of  such  crews  in  general;  however,  should  the  ship 
commander  strongly  influence  the  attitudes  of  his  ircn,  then  In 

all  probability  such  a  cluster  process  would  not  reflect  the 
over-all  set  of  attitudes.   It  can  be  seen  that  costs  and 
administrative  problems  would  be  reduced  if  cluster  sampling 
can  be  used  of  one  or  several  crews  rather  than  a  random  sample 
of  all  such  crews  in  the  Navy. 
Systematic  Sampling; 

Occasionally  the  population  from  which  a  sample  is  to 
be  taken  is  arrayed  in  a  numerical  sequence;  for  example,  a 
series  of  vouchers  or  serialized  cancelled  checks.   In  this 
case  we  can  use  the  technique  of  systematic  sampling  rather 
than  simple  random.   Let  us  suppose  for  example  that  we  have  a 
population  and  that  it  is  desired  to  select  a  random  saarcle  of 

200.   A  random  digit  may  be  selected  between  000  and  0049  to 
Indicate  the  starting  voucher  number.  From  this  point  on, 
every  50th  voucher  is  selected  in  a  systematic  manner.   The 
sample  of  200  then  constitutes  a  simple  random  sample  under 
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these  conditions .  We  must  use  caution,  however,  and  be  certain 
that  the  ordering  of  the  population  is  in  -no  -ray  dependent 
upon  the  characteristic  examined .   For  example  if  every  50th 
voucher  were  associated  with  a  Particular  comoanj,  it  would 
be  impossible  to  draw  any  general  inference  about  all  of  the 
vouchers  in  the  population. 
Sequential  Sampling 

Sequential  sampling  is  a  technique  developed  during 
World  War  II  by  Or.  Abraham  Wald  of  the  Statistical  Research 
Group  at  Columbia  University.   The  philosophy  of  this  technique 
is  that  often  decisions  can  be  made  before  the  entire  sample 
has  been  examined.   The  technique  Involved  in  this  type  of 
sampling  in  that  observations  are  selected  one  at  a  time, 
with  a  decision  being?  made  at  each  step,  based  on  the  cumulative 
Information  after  the  selection  of  each  observation.   Three 
decisions  nresent  themselves  (1)  that  ther*  is  rot  sufficient 
information  to  make  a  decision  and  an  sdditional  observation 
is  necessary;  (2)  the  data  are  inconsistent  with  the  hypothesis 
bcin?  tested  and  hence  it  must  be  rejected;  or  (3)  the  datr 
are  consistent  with  the  hypotheses  being  tested  and  hence  it 
may  be  accepted.   In  other  words  a  decidedly  defective  popu- 
lation will  be  apparent  after  the  examination  of  an  extremely 

small  pert  of  a  sample,  thus  it  becomes  only  necessary  to 
sample  until  our  hypotheses  Is  either  accepted  or  rejected* 
By  the  use  of  this  technique  it  is  possible  to  draw  a  conclusion 
with  a  sample  size  of  approximately  50*  of  that  used  for  simple 
random  sample. 
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Sampling  and  Non  Sampling  Errors 

Whenever  the  technique  of  sampling  if  used  there  is 
inherent  in  the  process  the  poesibility  of  encountering  sampl- 
ing errors  and  sampling  biases,  however,  whenever  sampling  is 
performed  so  that  every  unit  in  the  statistical  universe  has 
a  chance  of  being  selected  and  the  probability  of  selection  is 
known,  the  errors  of  sampling  can  be  controlled  satisfactorily. 
This  is  known  as  probability  sampling. 

Some  of  the  important  types  of  aampllng  errors  are 
encountered  when  conducting  a  survey.   The  average  lady  over 
forty  reports  her  age  as  under  forty,  a  bank  teller  states 
his  occupation  as  a  banker  and  significant  data  may  be 
eliminated  from  report  forms.   There  are  the  errors  of  the  non 
response  type  specifically  a  household  survay  conducted  during 
the  daytime  would  yield  a  poor  estimate  of  lady  truck  drivers, 
fcS  evening  follow  up  of  the  not  at  homes  would  be  necessary  to 
overcome  this  bias.   Errors  occur  in  sample  selection  such  as 

chunk  samples  conducted  for  convenience  at  Joe's  bar  or  the 
ball  park. 

The  result  of  a  sample  survey  will  hardly  ever  agree 

completely  with  that  of  a  complete  enumeration.   The  difference 
between  the  two  is  known  as  the  sampling  error.   If  a  probabi- 
lity sample  design  is  used,  it  is  possible  to  predetermine  the 
si7e  of  sample  needed  to  obtain  a  specified  degree  of  precision. 
Sampling  is  used  in  many  ways  throughout  the  Department 
of  Defense  the  results  providing  important  decision  making  data. 
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For  example  sample  surveys  are  conducted  of  military  personnel 
to  learn  certain  characteristics  about  them  and  to  find  out 
their  opinions  and  intentions  on  a  wide  variety  of  subjects. 
This  information  is  used  for  guidance  in  formulating  personnel 
policies  and  actions  that  have  considerable  impact  on  virtually 
all  defense  activities. 

Recently  a  sample  survey  was  conducted  of  Air  Force 
clothing  store  sales.   On  the  basis  of  ore-test  data  it  was 
determined  that  a  sample  of  35  out  of  more  than  200  stores 
world-wide  would  yield  acceptable  reliable  clothing  size 
distribution.   Sales  stores  were  stratified  by  size  of  store 
within  climatic  regions  and  samples  were  selected  from  each 
regional  listing.   The  results  of  the  survey  yielded  accurate 
size  distributions  for  principal  clothing  items;  the  distribu- 
tion of  sales  made  by  climatic  region,  of  sales  by  item,  and 
by  size  of  store.   Such  information  as  this  helps  to  achieve 
Improved  purchasing  and  stockage  procedures;  smaller  inven- 
tories, higher  inventory  turnover;  and  better  service. 

By  way  of  summary  it  can  be  stated  that  the  appropriate 
method  of  statistical  sampling  is  a  function  of  the  data 
desired,  the  form  and  availability  of  the  copulation,  the 
administrative  requirements  and  costs  associated  with  sampling. 
To  determine  the  method  of  sampling  is  the  problem  of  the 
sampler. 

Statistical  Inference 

In  this  section  we  shall  discuss  a  few  of  the  more 
important  techniques  of  statistical  inference  and  illustrate 


40 
some  of  the  various  techniques  with  actual  case  problems. 
Before  beginning  a  discussion  of  statistical  Inference  it  Is 
first  Important  that  the  term  be  understood.   By  inference  we 
mean  the  process  of  generalizing  from  a  bit  or  segment  of 
information  to  the  state  of  affairs  in  the  population  from 
which  the  segment  of  information  has  been  obtained.   It  Is  a 
process  of  inductive  reasoning  or  reasoning  from  the  specific 
to  the  general.   Stated  in  another  way  statistical  inference 
is  reasoning  from  the  statistic  and  applying  it  to  the  popula- 
tion—or generalizing  from  a  set  of  sample  results  to  the 
population.   This  is  a  phase  of  deductive  logic. 

There  are  two  basic  problems  of  statistical  inference; 
these  are  (1)  problems  of  estimation,  and  (2)  testing  a 
hypothesis.   For  example  we  may  select  a  sample  from  a  popula- 
tion for  the  purpose  of  estimating  some  parameter  or  character- 
istic in  the  population  such  as  to  estimate  the  proportion  of 
inventory  which  is  valued  at  less  than  $20,000.   This  is  known 
as  estimation.   Or  we  may  have  the  problem  of  observing 
whether  specifications  are  being  met  where  items  are  being 
manufactured  and  must  meet  a  specific  weight  or  tolerance. 
This  Is  referred  to  as  testing  a  hypothesis. 

There  are  several  methods  for  obtaining  solutions  to 
the  two  basic  problems  of  inference  and  will  be  discussed  In 
the  following  paragraphs.   The  methods  to  be  discussed  are  as 
follows:   Estimation  by  (1)  the  arithmetic  mean.  (2)  proportion . 
(3)  correlation  coefficient  and  (4)  regression.   The  techniques 
for  testing  a  hypothesis  are  (1)  the  arithmetic  mean 
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(2)  proportion  and  (3)  correlation  coefficient. 

The  following  table  indicates  the  symbols  for  the 
population  parameters  and  corresponding  sample  statistics: 

Characteristic  Parameters  Statistics 

Arithmetic  mean  /**   (Mu)         X 

Standard  deviation        <f   (slgma)       S 
Proportion  77" (pi)         p 

Correlation  Coefficient    P   (rho)         r 

Estimation 
Estimate  the  True  Proportion 

In  the  following  problem  it  is  desired  to  estimate  the 
true  proportion  of  a  population  possessing  a  particular 
characteristic.   Briefly  the  problem  was  to  conduct  an  audit 
and  estimate  the  number  of  pay  records  that  contain  monetary 
errors.    These  records  are  maintained  in  the  officers  accounts 
office  of  the  Department  of  the  Navy,  Washington,  D.  C.   For 
the  purpose  of  this  problem  a  monetary  error  is  defined  as  the 
omission  or  commission  of  an  act  which  is  either  definitely 
prescribed  or  prohibited  by  departmental  regulations.   In  this 
particular  problem  it  was  necessary  to  determine  the  correct 
sample  size;  to  determine  the  percentage  of  erroneous  pay 
records  in  the  sample;  and  to  estimate  the  number  of  erroneous 
pay  records  in  the  universe  by  statistical  inference. 

^George  D.  Tracy,  Lt.  Cdr.  USN,  MA  Pay  Record  Audit 
Problem."  Unpublished  term  paper,  Department  of  Statistics, 
George  Washington  University,  1958. 
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Because  we  are  dealing  with  pay  records,  and  pay- 
record  entries  govern  entitlement  to  money,  we  want  to  insure 
that  the  estimate  of  the  error  rate  in  the  universe  is  correct 
with  a  99$  degree  of  confidence. 

The  following  formula  is  used  in  the  solution  of  the 

problem.  0 

IT     7T  (1-  7T  )  N 


n 


(N-i)  e2  ♦  a2  jr   (i-  7T  ) 


Where : 

S  =  2.58  For  99^  confidence  limits.   Obtained  from  table  I 

appendix  A. 
7T=  .01  An  assumption  of  the  approximate  error  rate  based 

on  a  previous  URAO  AUDIT. 
E  =  .01   (Epsilon)   The  amount  of  error  that  can  be 

tolerated  in  the  statistic  "p". 
N  =  5,920  The  number  of  pay  records  in  the  universe  under 

study. 

By  substitution  of  this  data  in  the  formula  we  are  able  to 
determine  the  value  for  n  the  sample  size. 

n  =  (3t^)2  x  (,01)  x  (l-t01)  x  (5.920) 

(5,920-1)  x  (.01)2  ♦  (2.58)2  x  (.01)  x  (1-.01) 
n  ,  (6.66)   x  (.01)  x  (.99)  x  (5.920) 


n  =  ('°7)/X6-K,9l?0? 
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.66 
n  =  627 

Now  that  the  sample  size  of  627  has  been  determined 
the  next  step  is  to  prepare  an  alphabetical  listing  of  the 
names  of  the  officers,  for  whom  pay  records  are  maintained,  by 
running  a  "money  list"  using  the  addressograph  plates  kept  on 
file  for  that  purpose.  Following  the  preparation  of  the  money 
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list  a  starting  point  should  be  selected  at  random  and  this 
point  checked.   Additional  names  would  be  checked  at  an 
interval  of  ten  names  until  the  627  names  were  all  checked  off. 
The  next  step  of  course  is  to  go  to  the  pay  record  files  and 
pull  the  pay  records  of  the  627  names  that  have  been  checked. 
The  required  samnle  would  be  comprised  of  this  group  of  pay 
records. 

The  statistical  conclusion  that  can  be  made  from  this 
sampling  process  is  as  follows:   Let  us  assume  that  the  sample 
contained  12  pay  records  that  had  monetary  errors.   By  referring 
to  table  2,  appendix  A,  for  99$  confidence  limits;  where  HpM 
is  .02  and  the  sample  size  is  627  the  table  shows  that  the 
proportion  of  the  population  (  JT   )  lies  within  the  range  of 
.01  and  .035  which  is  written  .OK 77">. 035.   This  means  that 
with  99$  confidence  I  can  be  sure  that  the  percentage  of 
monetary  errors  in  the  universe  of  5,920  pay  records  lies 
between  1$  and  3.5^. 

There  are  other  approaches  to  this  type  of  problem. 
For  example  if  the  sample  size  is  large  then  the  following 
formula  yields  a  fairly  accurate  set  of  confidence  limits: 

p  -  Z  VP  d-P)/n  <  7T<  P  +  2  Vp  d-P)/n 

Let  us  consider  the  following  problem.  A  random  sample  of 
enlisted  men  indicated  that  21$  plan  to  re enlist  when  their 
present  hitch  is  up.   There  were  400  men  in  the  sample.   The 
99$  limits  are: 


3   oj- 
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.21   -  2.576  Y.21  x   .79/400    <  7T<  .21  =  2.576  V.21  x.  79/40 

.21   -.052   <  7T<  .21    ♦    .052 

.153    <  7T<  .262 

From  this  we  may  conclude  that  the  99^  confidence  limits  for 
the  true  proportion  expecting  to  reenlist  are  from  .158  to 

TO 

.262,    or  stated  another  way  from  16%  to  26%   plan  to  reenlist 
Correlation  Coefficient 

There  are  on  occasion  problems  where  two  variables  are 
analyzed  together,  and  a  correlation  coefficient  is  often 
computed.   Because  the  sample  correlation  coefficient  is 
subject  to  some  variation  it  is  often  of  Interest  to  estimate 
the  true  population  correlation  coefficient. 

There  are  precise  methods  for  calculating  the  desired 
confidence  limits,  however,  there  are  tables  that  can  be  more 
easily  used  to  obtain  the  95$  and  99$  confidence  limits  for 
the  true  correlation  coefficient.   This  of  course  is  especial!, 
useful  for  the  non  mathematician. 

For  example  a  study  was  run  to  examine  the  relationshi 
between  personnel  on  board,  and  recurrent  maintenance  and 
operation  costs  for  the  Marine  Corps  Supply  Depot,  Albany, 
Georgia.  ^  ^  sample  of  16  yielded  a  correlation  of  .67. 


12 

McCall,  op.  clt. .  p.  45. 

15S.  Frank  Leis,  Major  USMC  and  Jim  H.  Bolton, 
Captain  USMC.   "Correlation  of  Manpower  and  Maintenance  and 
Operations  Cost  of  Ten  Major  Marine  Corps  Posts  and  Stations. 
Unpublished  term  paoer,  Dept.  of  Statistics,  Oeorge  Washington 
University,  1953. 
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Consulting  table  IV  aopendix  A  the  95$  confidence  limits  are 
estimated  to  be 

.30  <  p  <  .87 

The  interpretation  of  these  results  will  be  referred 
to  later  in  our  discussion  of  correlation  and  regression 
problems,  along  with  estimation  by  regression  techniques. 
Arithmetic  Mean 

This  statistical  technique  is  used  when  it  is  desired 
to  estimate  the  mean  of  a  population,   A<  ,  using  the  mean 
of  our  sample  X.   If  the  standard  deviation  of  our  population 
is  known  the  interval  estimate  for  it  is  determined  by  the 
following  formula: 

X  -  Z  ■  f         S   v    /     x  ♦  z 


The  symbol  Z  is  the  normal  deviate  selected  from  table  I  to 
yield  the  desired  degree  of  confidence.   If  95%   confidence  is 
desired,  Z  =  1.96;  if  99^  confidence  is  needed  Z  =  2.576. 

For  example  let  us  assume  a  sample  of  flying  hours  was 
obtained  yielding  the  following  results: 

n  (number  of  observations)  =  100 
X  (mean  of  the  sample)  =1.8  hours 
the  standard  deviation  of  the  population  is  known  to  be 

=  .6  hours.   For  a  95^  confidence  level  the  interval 
estimate  for    (mu)  the  mean  of  the  population  becomes: 
1.8  -  1.96  x  .6/  VlOO  <  A<  1.8  +  1.96  x  .6/  VlOO 
1.8  -  .12  <a^<1.8  ♦  .12 
1.68  <^<  1.92 
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The  95%   confidence  limits  for  the  true  average  hours  of  flying 
time  are  1.68  to  1.92  hours. 

Many  times,  however,  f<    and  (5~  are  not  known.   In 
this  case  the  following  formula  is  used  X-t  s/  'Vn-l  <C  /C< 
X+t  s/  Vn-T.   In  this  formula  s  =  standard  deviation  of  the 
sample  which  can  easily  be  computed  and  "t"  is  obtained  from 
tables  of  selected  Students  H*  values.1 v 

Testing  a  Hypotheses 

In  practically  everything  we  do  in  our  daily  routine 
of  living  we  are  compelled  to  make  decisions.   These  decisions 
may  be  based  on  experience,  impulse  and/or  scientific  evidence, 
Although  there  is  some  diversity  of  basis  for  decision  making, 
it  is  still  possible  to  study  and  analyze  the  merits  and  dis- 
advantages of  the  various  methods  of  making  decisions  by 
evaluating  the  ricks  which  they  force  us  to  assume.  A  simple 
hypothesis  might  be  of  deciding  which  road  to  take,  the  right 
or  the  left,  when  arriving  at  a  fork  in  the  road  or  testing  a 
pair  of  dice  to  see  whether  they  are  biased.   The  technique 
of  testing  a  hypothesis  is  used  frequently  in  the  field  of 
quality  control.  Here  certain  standards  have  been  set  up  for 
production  and  periodic  samples  are  selected  to  determine 
whether  or  not  the  items  meet  specification.  A  hypothesis  is, 
therefore,  a  statement  which  it  is  desired  to  reject  with  a 
specified  degree  of  confidence,  or  to  accept  for  lack  of 
sufficient  contrary  evidence. ** 

l4McCall,  op.  clt.t  p.  43. 

^John  E.  Freund,  Modern  Elementary  Statistics  (New 
York:   Prentice-Hall,  1952),  p.  189,  ff. 
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There  are  two  types  of  errore  that  may  be  committed  in 
experimental  situations  of  testing  a  hypotheses.   These  are 
called  the  type  one  error  and  the  type  two  error.   The  type  I 
error  occurs  when  the  hypothesis  being  tested  is  true  and  we 
have  rejected  it.   Type  II  occurs  when  the  hypothesis  being 
tested  is  false  and  we  accept  it. 

Generally  in  testing  a  hypothesis  the  null  hypothesis 
i3  formed  and  is  represented  by  the  symbol  Ho  .   The  null 
hypothesis  is  a  hypothesis  which  is  formulated  for  the  express 
purpose  of  being  rejected.   It  is  much  easier  to  prove  things 
wrong  than  it  is  to  prove  things  right.   If  we  were  asked,  for 
example,  to  orove  that  all  sheep  are  white,  it  would  require 
investigation  of  every  single  sheen  to  determine  the  color. 
On  the  other  hano  it  would  take  only  one  black  sheep  to  show 
that  the  hypotheses  is  false.   Hence  it  is  much  easier  to 
calculate  the  error  which  may  be  committed  by  rejecting  a 
hypotheses  than  it  is  to  calculate  the  error  which  may  be 
committed  by  accenting  it. 
Correlation  Coefficient 

When  a  sample  correlation  coefficient  has  been  computed 

an  interpretation  is  sometimes  desired.   In  the  correlation  of 

manpower  and  maintenance  and  operations  costs,  °  mentioned 

earlier,  the  null  hypothesis,  that  there  was  no  significant 

correlation  between  the  two  variables  in  the  population  of 

personnel  on  board  and  maintenance  and  operations  costs  was 

formed. 

H0    :  (0  =  0 

16 

Leis   and  Bolton,    op.    clt. 


To  test  this  hypothesis  we  refer  to  table  IV  appendix 
A  for  p    (rho)  *  0  and  n  n   16,   The  95$  confidence  limits  for 
r  are  estimated  as  -.49  <  r  <  «k49  hence  it  is  determined  that 
if  the  calculated  correlation  coefficient  lay  outside  this 
range  we  would  have  a  significant  correlation  between  the 
variables,  with  95;£  confidence  that  a  true  hypothesis  was  not 
being  rejected.   The  data  for  the  problem  produced  an  v  =   .69 
for  the  Marine  Corps  Cupply  Depot,  Albany;  r  =  .53  for  Marine 
Corps  Supply  Forwarding  Annex  San  Francisco;  and  r  -  .29  for  a 
consolidation  of  the  ten  major  Marine  Corps  stations.   Since 
the  r  -  .29  for  the  ten  fflftj  r  stations  falls  within  the  range 
of  chance  alone  -.49  <  r  <  +  .49  the  hypothesis  that  there  was 
no  significant  correlation  was  accepted.   For  the  individual 
stations,  however,  the  correlation  coefficients  lie  outside  the 
range  of  chance  alone,  hence  the  hypothesis  ii]  these  cases  was 
rejected.   Since  there  was  no  correlation  for  the  ten  major 
stations  and  only  a  poor  correlation  for  the  others  there  was 
no  need  to  continue  the  problem  for  the  ourpose  of  estimation 
by  linear  repression  techniques.   Further  study  using  regression 
technioues  is  necessary  to  determine  the  relationship  of  the 
variables  in  the  case  of  the  other  two  stations. 

To  illuetrate  another  case  l*t  us  use  the  crobleic  of 
where  a  random  sample  of  2C0  recruits  were  given  two  forces  of 
a  test.   FornD  A  has  been  in  use  for  15  years  and  took  8  hours 
to  administer.  A  proposed  revision,  form  B,  took  only  two 
hours  to  administer.   If  the  correlation  between  these  two 
forme  was  found  to  be  .45,  i?  it  safe  to  conclude  at  95^  level 
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that  there  is  better  than  a  chance  relationship  in  the 
population?1^  Using  table  IV,  appendix  A,  for   =0  and 
n  s  200,  the  95^  limits:  for  r  are  estimated  as: 

-  .14  <   <  +  .14 
Since  the  sample  correlation  of  .45  exceeds  the  eanple  we  can 
assume  with  a  9S\-  confidence  that  the  two  fores  bear  some 
relationship  to  each  other.   Further  study  would  be  required 
to  determine  how  Broth* 
Proportion 

Certain  tyoee  of  problems  are  adapted  to  testing  the 
hypothesis  that  the  true  prorortion  (  77")  is  equal  to  some 
specific  value.   For  example  an  ordnance  depot  contained  a 
stock  pile  of  40,000  proximity  fuses  from  World  War  II.   If 
possible  it  was  desired  to  use  these  fures  lr.     -re   modern 
weapons  system  provided  they  could  produce  a  specific  degree 
of  reliability.   If  this  is  possible  it  could  provide  a  tre- 
mendous saving  in  our  defense  effort.   Since  the  fuse  was  only 
one  component  out  of  many  in  this  system  a  95$  reliability  was 
desired.   A  sample  was  taken  consisting  of  400  fuses,  360  were 
found  acceptable  at  the  5%   level,  can  the  hypo thee la  that  the 
lot  meets  requirements  be  accented?  The  mathematics  involved 
in  this  problem  is  as  follow? : 

p  a  ?60  the  amount  of  the  sample  found  acceptable 

n  =  400  the  amount  in  the  sample 

TT  -   .95  the  degree  of  confidence  desired 

p  a  36^/400  at  .90 


17McCall,  op.  cjt..  p.  52,  ff, 
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consult!     ble  3,  appendix  a  for  n  =  400  am  JT    •  .95 
the  limitr  for  p  are: 

.925  <  p  <  .97 
The  sample  proportion  is  less  than  .925  hence  we  reject  the 
hypothesis  and  conclude  with  a  battel  than     confidence  that 
the  lot  does  not  raeet  the  reliability  requirements. 
Arithmetic  aean 

A  hypothesis  may  also  be  tasted  by  examining  the 
sample  Bean  to  pee  whether  it  is  cents  in  ed.  within  tha  limits. 
If  tha  standard  deviation  of  the  population  {  <5~  )    is  known  the 
following  formula  is  used. 


a  -*T7r-  <  x</° 


+  a 


'VF 


Let's  assume  a  manufacturing  firm  plans  to  establish  controls, 
to  cording  to  government  regulations  concerning  a  specified 
diameter  of  a  production  item.   The  lean  diameter  should  be 
21cm  with  ■  .5cm.   Sample  sizes  of  16  are  chosen  periodically 
and  a  decision  is  made  regarding  the  control  process  at  each 
time.  A  sample  mean  of  21.3  h&s  been  obtained  and  the  95^ 
level  is  considered  acceptable.   Conclusions: 

1.  The  null  Hypothesis  that  the  orocese  is  in  control 
is 

H0   ryH.  =  21em 

2.  Since  the  amount  of  error  tolerated  is   =.05  the 
appropriate  value  for  Z  is  1.96,  table  I. 

3.  The  required  limits  are: 

21  -  1.96  x  ,§/  Vl6  <  X  <  21  +  1.96  x  .5/  Vl6 
21  -   .245  <  X  <  -21  +  .245 
20.755  <  X  <  21.245 
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4.   X  =  21.3  cm.  falls  outside  of  the  limits,  hence 
we  reject  our  hypothesis  with  better  than  95$  confidence  and 
examine  the  process. 

When  the  $~    is  not  known  the  following  formula  is  used 
)JL   -  t  S/  V^l  <  X  <  /<.  +  t  S/  Vn^l 

the  value  for  "tH  is  obtained  from  Students  table  for  values 
of  V. 

Correlation  and  Regression 

Correlation  analysis  is  a  statistical  process  of 
discovering  and  measuring  functional  relations  of  two  or  more 
series  of  data  in  order  to  ascertain  the  extent  to  which  the 
variables  are  correlated.   In  other  words  to  determine  to  what 
extent  they  move  together  or  oppositely.   Stated  another  way 
correlation  involves  a  study  of  the  relationships  existing 
between  two  or  more  variables,  while  regression  may  be  defined 
as  the  prediction  of  one  variable  by  means  of  one  or  more 
addition  variables. 

To  illustrate  an  approach  to  this  technique  when  only 
two  variables  are  involved  would  be  to  plot  our  observed  data 
on  graph  paper  using  an  X  and  Y  axis  for  each  of  the  variables, 
for  example  smoking  versus  lung  cancer.   If  the  observations 
were  scattered  as  seen  below  in  figure  2  we  would  assume  that 
no  correlation  exists, 


fig.  2 
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however,  in  figure  3  lung  cancer  increase  with  smoking,  not 
a  significant  correlation  however.         a 


fig.  3 
The  observations  in  figure  4  show  a  strong  positive 
correlation, 


i 


♦    e 


fig.  4 
while  figure  5  describes  a  strong  negative  correlation. 


fig.  5 
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For  linear  relationships,  correlations  which  describe 
a  straight  line  relationship  between  the  two  variables,  a 
measure  of  the  degree  of  this  correlation  has  been  developed 
by  Karl  Pearson.   This  measure  is  known  as  the  Pearson  product- 
moment  correlation  coefficient.   This  measures  only  the  linear 
relationship  between  the  variables  and  is  written: 

[n       XY  -    X   YJ 


2 

rc  = 


[n   ZX2  -    (    Z   X)2J[    nil2-    (    Z  Y)2] 

To  illustrate  the  concept  of  correlation  let  us 
consider  a  typical  problem  and  one  that  is  of  continuing 
Interest  to  the  comptroller  from  the  standpoint  of  budget 
prediction  and  cost  control.   The  particular  problem  under 
consideration  was  formulated  for  the  expressed  purpose  of 
attempting  to  find  a  method  for  predicting  the  recurrent  station 
operations  and  maintenance  costs  for  Marine  Corps  posts  and 
stations.   In  this  particular  case  the  data  did  not  produce 
significant  results  when  all  of  the  stations  were  consolidated 
indicating  this  approach  was  not  a  useful  predictive  device 
under  this  particular  set  of  circumstances,  however,  correla- 
tions were  evident  when  the  data  for  each  individual  post  or 
station  was  computed  and  analyzed  individually.   For  the 
purpose  of  illustrating  this  technique  we  shall  use  the  data 
for  the  Marine  Corps  Supply  Depot,  Albany,  Georgia.  °     The 
original  data  are  omitted  but  the  following  sums  were  obtained 
from  16  observations.  *Xn   represents  personnel  on  board  and 

Lels  and  Bolton,  op.  clt. 
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MYM   represents   recurrent  operations  and  maintenance  costs 
in  dollars, 

ZX  =  54,900 
ZX2  =  190,508,002 

ZY  «  61,959xl° 

4 
^Y2  =  24l,323,927Xl° 

2 
ZXY  =  213,754,156Xl° 

(  Z  X)2  =  3,114,010,000 

(  ZY)  m   3,838, 917, 68lxl° 

.  2 
IXZY=  3,401,594,100  x 

n  =  16 

X  =  3,431.25 

Y  =  387,243.75 

Substitution  of  these  data  in  the  formula  page  53  provides 

the  following: 

2                                             2 
I  16  (2l3,?5li,l56x10  )  .  3,iioi,5U.9,100x10  1 
y2-r J; 

[l6  (190,503,002)-3,liU,010,000][l6  (2Ul,323,927xloU).3,838,917,68lxloUJ 

r2  a  .45138877 

r  *   .671854  (this  is  the  square  root  of  r2) 

The  r2  is  called  the  coefficient  of  determination  and  is  some- 
times used  as  a  measure  of  agreement.  It  gives  the  percentage 
of  the  variation  in  one  variable  which  may  be  accounted  for  by 
the  variation  in  the  other  variable.   In  this  case  r  is  .45139. 
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This  indicates  that  the  variability  in  recurrent  operations 
and  maintenance  costs,  45.12$  may  be  accounted  for  by  the 
variability  in  personnel  on  board.   The  remaining  54.88$  is 
due  to  other  factors. 

Since  our  purpose  in  this  problem  is  to  utilize  one 
variable  to  predict  the  other  it  now  becomes  necessary  to 
extend  the  correlation  concept  into  regression.  We  shall 
consider  only  the  simplest  of  relationships  the  straight  line 
or  linear  relationship  at  this  time.   In  the  population,  the 
true  linear  relationship  is  represented  by  the  equation 

Y  =  ^  +  p    X 
where  <?C  and  @      are  constants,  ^  being  the  Y  Intercept 
and  (3   being  the  slope  of  the  regression  line.   The  regression 
line  obtained  from  the  sample,  is  Y  =  a  +  bX.   a  and  b  are  the 
sample  estimates  for  the  parameters  ^  and  @   .   To  obtain 
a  and  b  the  following  formula  is  usedi 

n  £XY  -   SX   ^Y 


b  m 


n  IX2  -   (  2"X)2 


a  =  Y  -  b  X 

Substituting  the  data  in  the  formula  we  have: 

b  =   16  (213.754.1B6*10  )  -  3.4O1.549.100Xl° 
16  (190,508,002)  -  3,114,010,000 

xlO 

b  =   i|i57.??8 


2,132,377 
b  =  54.27454 
a  =  387,243.75  -  54.27454  (3431.25) 


. 
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a  b  367,2*3.75  -  186,229.52 
a  *  201,014.23 

Y  =  a  +  bX 

Y  =  201,014.23  +   54.2745X 

On  the  basis  of  these  answers  we  can  now  predict  the  value  for 
Y,  the  recurrent  operations  and  maintenance  cost,  for  a  given 
value  of  X,  the  personnel  on  board.   If  we  plot  these  data 
on  a  graph  we  can  see  that  there  is  a  fair  positive  correla- 
tion, however,  it  is  felt  this  correlation  is  not  quite  strong 
enough  to  be  used  with  reliability  in  estimating  budget 
requirements. 

(Figure  6  is  on  following  page) 
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25  ZA    is     Z6  Z7     t$    Z9    5®    Jl  3Z  h    31  J5"  54  57  5£  5/ 
HWN0RED5 

fig.  6 
Our  next  concern  is  to  determine  the  standard  error 
of  the  estimate.  This  Is  a  measure  or  value  of  the  regression 
line.   Of  course  the  better  the  correlation  the  closer  the 
dots  will  be  clustered  about  the  regression  line.   This  measure 
is  actually  an  estimate  of  the  standard  deviation  of  the  ob- 
served points  about  the  regression  line.   The  sample  formula 


-O^ 


+mtm+ 


58 
for  this  measure  is  as  folio 


Sy.x  =V  ii-^JTL-LJL  where    Jy2  m    Zy2  _    (    Zy)2 
n  -  g  a 

\/     (1  -.145)  86,973.25Xl°  2  ^qU  ^qU 

Sy.xsv  jy^  =  2^,323,927^  -3,838,i4l7,68l3CLU 

Ik 


16 
4 

2    a,    ^-,       o^XlO 


Sy.x  m\j    34,168,062.50      ly2  =  86, 973,. 25 
Sy.x  =  5845.34 

This  standard  error  is  relatively  small,  being  roughly 
2%.      This  result  cannot  be  attributed  to  a  strong  correlation, 
but  mainly  to  the  relatively  small  ran^e  of  values  of  Y.   The 
range  of  values  is  approximately  33$ • 

Conclusion 

In  conclusion  of  this  section  it  is  well  to  point  out 
that  the  statistical  technique  were  only  a  few  of  the  many 
different  applications  of  mathematical  techniques  that  can  be 
applied  to  meet  the  particular  needs  of  the  situation.   The 
point  that  this  author  would  like  for  the  reader  to  carry  away 
is  that  the  mathematics  involved  is  merely  the  substitution  of 
numbers  for  letters  and  simple  arithmetic.   It  is  hoped  that 
the  examples  presented  to  illustrate  the  statistical  methods 
illustrate  the  variety  of  control  type  problems  that  can  be 
solved  by  the  techniques.   In  addition  to  the  problems  already 
presented,  correlation  and  regression  has  been  used  to  compare 
direct  labor  hours  to  product  overhead  in  dollars  for  naval 
ship  yards.   Maintenance  and  operations  costs  have  been  studied 
with  aircraft  on  board  for  air  stations  producing  some 
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Interesting  values  for  r.   The  sampling  techniques  discussed 
in  the  first  part  of  this  section  have  been  used  to  develop  a 
work  sampling  method  for  naval  shipyards.   Correlations  have 
been  determined  for  such  additional  problems  as  a  comparison 
of  sick  leave  token  by  various  categories  of  civil  service 
personnel,  and  a  comparison  of  N  and  C  costs  to  plant  property. 
In  the  field  of  work  measurement  these  statistical  techniques 
are  the  major  tools. 


CHAPTER  IV 

LINEAR  PROGRAMING 

Within  the  everyday  activities  of  executives  and  the 
constant  requirement  for  their  decisions  there  exists  mathe- 
matical tools,  other  than  those  just  discussed  in  chapter  III, 
which  he  may  use  to  assist  him.   In  recent  years  mathematicians 
have  worked  out  a  number  of  new  procedures  which  make  it 
possible  for  management  to  solve  a  wide  variety  of  important 
problems  much  faster,  more  easily,  and  more  accurately  than 
ever  before.   These  procedures  have  been  called  "linear 
programing"  and  sometimes  "mathematical  programing." 

Mathematical  programing  is  not  just  an  improved 
way  of  getting  certain  Jobs  done.   It  is  in  every 
sense  a  new  way.   It  is  new  in  the  sense  that  double 
entry  booking  was  new  in  the  Middle  Ages,  or  that 
mechanization  in  the  office  was  new  earlier  in  this 
century,  or  that  automation  in  the  plant  is  new  today. 
Because  mathematical  programing  is  so  new,  the  gap 
between  the  scientist  and  the  businessman — between 
the  researcher  and  the  user—has  not  yet  been  bridged. 
Mathematical  programing  has  made  the  news,  but  few 
businessmen  really  understand  how  it  can  be  of  use  in 
their  own  companies. 

Within  the  planning  and  programing  activities  of  the 

Armed  Services  there  are  many  problems  which  are  concerned 

with  the  most  efficient  use  of  men  and  materiel.   The  solutions 


1 

Alexander  Henderson  and  Robert  Gchlaifer,   Mathematic- 
al Programing,"  Harvard  Business  Review  (May  June,  1954),  P»73. 
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to  these  problems  contribute  much  to  successful  operation 
and  decision  making  within  the  various  branches  of  the  Armed 
Services. 

The  development  end  application  of  methods  for 
determining  solutions  to  such  programming  problems  is  a  matter 
of  continuing  research.   With  the  cooperation  of  mathematic- 
ians, economists  and  other  scientists,  both  in  and  out  of  the 
Armed  Services,  mathematical  procedures  have  been  devised  that 
solve  an  important  class  of  programing  problems. 

Basic  Principles 

Problems  in  the  nlanning  and  programing  area  are 
characterized  by  limitations  and  constraints  imposed  upon  the 
availabilities  of  the  items  under  consideration.   For  example, 
the  number  of  men  and  dollars  available  to  operate  a  program 
may  be  limited. 

These  problems  are  capable  of  producing  many  possible 
solutions.   That  is  the  constraints  of  such  problems  can  be 
satisfied  in  many  different  ways.   The  selection  of  the  most 
acceptable  or  desirable  solution  ordinarily  calls  for  judgment 
and  experience. 

What  the  mathematical  methods  of  linear  programing  do 
is  to  reduce  the  whole  procedure  to  a  simple,  definite  routine. 
There  is  a  rule  for  finding  a  program  to  start  with;  there  is 
a  rule  for  finding  the  successive  changes  that  will  increase 
the  profits  or  lower  the  costs,  and  there  is  a  rule  for 
following  through  all  the  repercussions  of  each  change.   What 
is  more,  it  is  absolutely  certain  that  if  these  rules  are 
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followed,  they  will  lead  to  the  best  possible  program;  and 
it  will  be  perfectly  clear  when  the  best  possible  program  has 
been  found.   It  Is  because  the  procedure  follows  definite 

rules  that  it  can  be  taught  to  clerical  personnel  or  handed 

2 

over  to  automatic  computers.    In  other  words,  by  a  systematic 

procedure,  a  single  efficient  solution  is  chosen  that  meets 
the  conditions  of  the  problem.  A  solution  is  efficient  if  it 
accomplishes  a  specified  objective  of  the  problem.   The 
efficient  solution  may,  according  to  the  problem,  minimize 
dollar  cost  or  expenditure  of  a  critical  item,  or  maximize 
some  measure  of  combat  effectiveness:.   Such  a  solution  that 
can  solve  both  the  constraints  of  the  problem  and  the  given 
objective  is  an  optimum  solution.. 

The  computational  methods  of  linear  programing  are 
ry  general  and  are  applicable  to  a  large  variety  of  problems 
such  as  the  following: 

1.  The  most  efficient  routing  (in  termB  of  transporta- 
tion cost,  or  time,  or  requirements  for  use  of  ships  or 
aircraft)  of  material  from  a  group  of  supply  or  production 
points  to  a  group  of  storage  or  user  points. 

2.  The  efficient  and  economical  use  of  scarce  raw 
materials,  or  machines,  to  produce  the  maximum  output  of 
critical  goods. 

3.  The  best  assignment  of  output  production  of  military 
items,  such  as  planes |  between  competing  demands  of  combat 
and  training     Ions. 


2Ibld.,  p.  75. 
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4.   The  minimum  cost  of  contract  awards  to  meet  a 
given  supply  requirement  from  among  a  group  of  bidders  quoting 
various  oriees,  maxima  and  minima  production  for  various 
items,  and  other  ouantity  or  price  restrictions  as  to  bids. 

Basic  to  all  of  the  problems  is  the  matter  of  limited 
recourses  which  must  be  shared  among  a  number  of  competing 
demands,  and  as  can  be  seen  from  the  above  problems,  these 
limited  recourses  may  be  manpower,  or  capital,  or  material, 
or  equipment,  or  possibly  a  combination  of  all  of  these. 

Mathematical  programing,  of  which  linear  programing 
is  but  one  phase,  has  been  termed  a  restatement  of  the  central 
formal  problem  of  economics — that  of  allocating  scarce 
resources  so  as  to  maximize  the  attainment  of  some  predetermined 
objective- -in  a  form  which  is  designed  to  be  useful  in  making 
practical  decisions  in  business  and  economic  affairs.   The 
accent  here  is  on  the  word  "practical."  The  theory  and 
principles  that  have  led  to  the  development  of  the  mathematic- 
al programing  techniques  coming  increasingly  into  business  and 
military  use  may  veil  be  far  beyond  the  comprehension  of  the 
non-mathematical  layman,  but  there  is  nothing  theoretical  about 
their  applications;  a  glance  at  the  four  typical  problems 
listed  above  shoula  dispel  any  doubt  on  that  score.   It  is  no 
longer  premature  to  say  that  mathematical  or  linear  programing 
has  proved  its  worth  .  .  .  for  finding  optimum  economic 
programs.  ...   It  is  a  practical  tool  for  the  comptroller  and 
all  phases  of  business  planning.-5 

^R.  Dorfman,  "Mathematical,  or  'Linear'  Programming," 
American  Economic  Review.  XLIII  (December,  1953),  P#  797. 
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Problems 

A  simple  example  of  an  application  of  linear  programing 
is  probably  the  best  manner  in  which  to  explain  and  demonstrate 
Just  what  it  can  do.   Of  the  many  types  of  problems  that  lend 
themselves  to  the  methods  of  linear  programing  for  solution 
are  a  group  termed  transportation  problems.   These  problems 
call  for  the  shipping  of  a  supply  of  an  item  from  a  group  of 
depots  to  a  group  of  receiving  stations.   Each  depot  has  a 
limited  amount  of  the  item,  while  each  station  has  a  specific 
requirement  for  the  item.   Usually  there  are  many  routings 
which  will  supply  each  station  with  exactly  the  required  amount 
of  the  item.   The  problem  is  to  determine  the  routing  which 
not  only  fulfills  the  requirements  but  also  minimizes  some 
measure  of  the  program  cost.   For  example,  the  objective  might 
be  to  minimize  one  of  the  following:   the  total  dollar  cost, 
the  total  number  of  miles  of  the  shipping  schedule,  or  the 
total  time  the  items  are  in  transit. 

A  fictitious  example  of  a  routing  problem  follows. 

The  objective  in  this  case  is  to  minimize  the  total  ton-miles. 
Let  us  assume  that  Lockbourne  AFB  at  Columbus,  Ohio  has  been 
testing  a  large  item  of  equipment,  weighing  a  ton,  for  the 
B-47.   It  is  now  desired  that  this  equipment  be  tried  at  other 
bases.   Five  each  are  required  by  March  AFB  at  Riverside, 
California;  Davis-Monthan  AFB  at  Tucson,  Arizona;  and  McConnell 

4 

The  Application  of  Linear  Programming  Techniques  to 

Air  Force  Problems:  A  Non- technical  Discussion  (Washington, 
D.  C:  Directorate  of  Management  Analysis,  DCS/Comptroller, 
Hdqts.,  USAF,  17  December  1954),  p.  10. 
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AFB  at  Wichita,  Kansas.   Pinecastle  AFB  at  Orlando,  Florida, 
and  MacDill  AFB  at  Tampa,  Florida,  each  need  three.   To  supply 
these  21  items  Lockbourne  AFB  at  Columbus,  Ohio  can  ship  8; 
Oklahoma  City  Depot  has  8,  and  Warner-Robins  AFB  at  Macon, 
Georgia  has  5»      The  items  of  equipment  are  to  be  air  lifted 
to  their  destinations.   The  problem  is  to  determine  the  routing 
which  fulfills  the  requirements  and  minimizes  the  total  ton 
miles.  All  the  foregoing  information  together  with  the  appro- 
ximate  air  distances  is  summarized  in  the  following  table: 


MacDill 

rch 

Davis 
Monthan 

Mc Conn ell 

Pinecastls 

Items  Re- 
Avail-  quired 
able   Items 

3 

5 

5 

5 

3 

Oklahoma 

City        8 

938 

1030 

824 

136 

995 

Macon       5 

346 

1818 

1416 

806 

296 

Columbus     8 

905 

1795 

1590 

716 

854 

Distance  in  Miles 


A  first  attempt  at  a  solution  might  go  like  this:   Try 
to  fill  each  requirement  from  the  nearest  source.   For  example, 
let  McConnell  get  its  5  items  and  Davis-Monthan  3  of  its  5 
items  from  Oklahoma  City.   The  requirements  for  the  Florida 
bases  should  be  met,  as  far  as  possible,  from  Macon.   Let 
Pinecastle  get  its  3  and  MacDill  2  of  its  3  items  from  Macon. 
The  rest  of  the  requirements  must  be  met  from  Columbus.  This 
shipping  schedule  has  a  total  of  17,792  ton-miles  and  is 

5 Ibid.,  p.  11,  ff. 
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illustrated  in  figure  7,  page  67. 

There  are  many  other  possible  routings.  The  minimum 
solution,  obtained  by  the  methods  of  Linear  Programing,  has 
a  value  of  16,864  ton-miles.  The  computational  procedure  starts 
with  any  solution  to  the  problem — such  as  the  one  desired 
above — and  in  a  systematic  manner  obtains  better  solutions 
until  the  optimum  one  has  been  determined.  This  Linear  Program- 
ing procedure  guarantees  that  the  optimum  will  be  found.  For 
this  simple  example,  it  is  not  difficult,  given  the  number  of 
ton-miles  in  the  optimum  solution,  to  discover  the  corresponding 
routing.  Without  this  information,  one  would  not,  in  general, 
be  able  to  determine  Just  which  solution  is  optimum.  This  is 
especially  true  for  larger  problems  where  solutions  by  hand 
becomes  impractical. 

If  in  attempting  to  take  advantage  of  the  shortest 
distance  appearing  in  the  problem  by  trying  to  fill  all  of 
McConnell's  needs  from  Oklahoma  City — the  resulting  solution 
would  be  less  efficient.   This  fact  serves  to  emphasize  that 
what  at  first  appears  to  be  a  logical  approach  to  problems  of 
this  nature  is  not  necessarily  the  best  one. 

Another  simple  example  of  a  transportation  type 
problem  is  as  follows:  Suppose  that  a  regional  freight  car 
distributor  must  send  empties  from  three  divisions  on  which 
car  surpluses  are  predicted,  to  five  other  divisions  exactly 
equal  to  the  total  surpluses,  though  this  is  not  a  limitation 
for  other  than  simplifying  the  example.   Conditions  can  best 
be  expressed  and  summarized  in  the  following  table: 
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Origin         Destination Surpluses 

Dl     D2     D3  D4     D5 

51  9 

52  4 

S3  a 

Shortages       3      5      4      6      3      21 

It  should  he  noted  at  this  point  that  the  problems 
given  have  necessarily  been  extremely  simple;  so  simple,  in 
fact,  as  to  be  susceptible  of  solution  by  trial  and  error  if 
not  by  inspection.  But  if  the  examples  were  too  extensive, 
they  would  fall  as  instructive  and  readily  comprehendable 
examples.   The  table  merely  states  that  Division  SI  has  a 
surplus  of  9  cars;  D2  a  shortage  of  5  cars,  etc.   The  cost  of 
shipping  an  empty  car  from  a  shipping  point  to  a  destination 
is  tabled  as  follows: 
Origin  Destination 

D2     D3     DA     D5 
20      5      9     10 
10      8     30      6 
20      7     10      4 
The  table  indicates  that  it  costs  $10  to  send  an  empty  car 
from  Division  SI  to  Dl;  C30  to  send  a  car  from  S2  to  D4,  etc. 
Now,  while  it  is  possible  to  obtain  a  feasible  solution  by 
inspection,  it  is  not  possible  to  know  whether  the  solution 
selected  is  the  least  cost  solution,  or  what  greater  cost,  if 
any,  the  solution  has  than  the  least  cost.   Linear  Programing 
offers  a  method  not  only  of  obtaining  the  least  cost  solution, 
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but  also  of  ensuring  that  It  is  in  fact  the  least  cost. 
Appendix  "c"  gives  the  solution  to  this  problem.   The  minimum 
cost  is  $150;  one  might  find  it  interesting  to  make  up  a 


feasible  solution  and  cost  to  see  how  it  comperes  with  the 
|150  figure. 

When  one  realizes  that  the  same  technique  is  capable 
of  solving  problems  involving  a  table  300  x  500  as  easily  as 
the  3x5  table  shown  in  the  example,  the  advantages  are 
obvious.   The  time  required  is  of  course  greater,  but  the 
technique  is  identical,  and  with  the  aid  of  electronic 
computers  the  time  required  is  a  matter  of  hours  or  minutes. 

Military  Use 

Within  the  Armed  Services  it  is  often  desired  to  make 
an  efficient  deployment  of  combat  aircraft  to  competing 
activities  or  missions  given  a  production  schedule.   Those 
high  priority  missions  of  course  have  first  claim  on  the  avail- 
able aircraft.   It  is  necessary  that  we  orovide  the  maximum 
amount  of  aircraft  for  combat,  however,  we  want  to  simultaneous* 
ly  allocate  enough  aircraft  to  advanced  flying  schools  to  train 
crews  to  fly  them.  Another  objective  is  to  minimize  the 
surplus  of  crews.  Results  to  be  computed  on  a  world  wide 
basis  and  then  by  area.   These  examples  are  typical  of  the 
type  of  problem  that  faces  the  military  executive  that  can  be 
solved  by  linear  programing  procedures. 

These  problems  of  course  are  mainly  of  an  operational 

"Operational  Research  in  Distributing  Empty 
Cars,"  Railway  Age.  CXXXIV  (April  20,  1953),  pp.  73-4. 


. 


' 


70 
nature,  however,  In  the  end  analysis  cost  is  involved  and  in 
this  the  comptroller  becomes  vitally  interested.   More  speci- 
fically the  comptroller  would  be  interested  in  problems  con- 
cerning contract  awards.   For  example:   Whenever  it  is  desired 
to  procure  items  from  civilian  sources,  producers  of  the  items 
murt  be  invited  to  participate  in  the  bidding  for  contracts. 
The  individual  manufacturer  submits  bids  in  which  he  states: 

1.  The  orice  per  unit  of  article  or  articles. 

2.  The  maximum  (and  minimum,  if  any)  quantity  of 
each  item  that  can  be  produced  at  the  stated  price,  and 

3.  Any  other  conditions  he  wishes  to  impose.   The 
bid  reflects  the  manufacturer's  desire  for  profit,  his  guess 
about  the  other  fellow's  bid,  and  his  own  peculiar  limitations. 

The  comptroller  ultimately  having  to  pay  for  the 
procurement  of  specific  quantities  of  related  articles,  must 
see  that  contracts  are  awarded  in  such  a  way  that  the  total 
dollar  cost  to  the  government  is  at  a  minimum. 

In  evaluating  bids  the  comptroller's  shop  must  add 
shi*Dping  and  other  related  costs  to  each  of  the  bidders'  quoted 
prices.   Similarly,  any  savings  that  could  be  effected  by 
agreeing  to  certain  conditions  are  subtracted,  e.g.  a  discount 
may  be  allowed  for  payments  made  within  a  certain  time.   Once 
the  contracts  have  been  awarded,  the  procurement  office  must 
be  ready  to  demonstrate — even  to  the  satisfaction  of  the  losing 
bidders— that  the  total  cost  to  the  activity  of  all  contracts 
was  the  least  possible. 
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The  Bureau  of  Shins,  Department  of  the  Navy,  has 
applied  Linear  Programing  to  yet  another  problem,  most  closely 
related  to  the  economics  nroblem  of  the  demand  too  great  and 
the  resources  too  few.   Electronics  equipments  must  be  allocat- 
ed to  fill  all  authorized  allowance  requirements  aboard  the 
ships  and  on  shore  stations  of  the  Naval  Establishment. 
Equipments  are  seldom  available  in  sufficient  quantities  to 
meet  all  requirements.   In  such  cases,  priority  of  ship  types 
and  projects  is  one  of  the  controlling  factors.   The  use  of 
new  programing  techniques  will  give  an  optimum  use  factor  to 
the  equipments  available,  and  hence  to  the  ships  they  go  on, 
than  has  formerly  been  possible. 

Limitations  and  Other  Methods 

Limitations  of  the  technique  of  applying  Linear 
Programing  are  two-fold  in  nature:   mathematical  and  practical. 
The  practical  difficulties  are  those  commonly  associated  with 
applying  any  pure  science  to  business  and  financial  problems. 
One  finds  it  difficult  to  express  in  suitable  measurable  terms 
the  objectives  and  controlling  factors,  or  constraints; 
mathematical  programing  has  no  monopoly  on  this  difficulty,  and 
it  is  of  course  a  difficulty  associated  with  the  particular 
operation  under  investigation  and  study,  not  with  the  technique 
being  employed  to  investigate  it.   Secondly,  even  when  one  is 
able  to  find  suitable  and  measurable  terms  for  the  operation, 
it  may  well  be  exceedingly  difficult  to  determine  proper 
numerical  values  for  the  coefficients  of  the  terms.   Like  any 
other  problem,  if  a  more  precise  answer  is  needed,  more  precise 
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data  must  be  used,  and  that  in  turn  entails  a  thorough  and 
detailed  etudy  of  the  oroblem.   Finally,  there  is  the  computa- 
tional labor  required  to  solve  the  problem,  once  it  has  been 
set  up  in  measurable  terms  with  proper  coefficients.  The 
very  fact  that  mathematical  programing  techniques  are  being 
used  to  investigate  a  given  operation  is  indication  in  itself 
that  the  problem  is  quite  complex,  with  many  variables  and 
constraints,  such  that  it  does  not  lend  itself  to  quick 
solution  by  other  means,  or  by  inspection.   We  must  necessari- 
ly expect  then  in  cases  of  application  of  the  technique  that 
computational  labor  may  be  immense,  even  for  electronic 
clauclators;  must  accept  that  fact  as  the  nature  of  the  beast; 
and  be  thankful  that  there  is  a  technique  to  help  us  where 
until  recently  there  was  nothing  but  educated  guessing. 
"Thus  the  practical  difficulties  encountered  in  applying 
linear  programing  are  in  a  sense  an  indication  of  the  level 
of  difficulty  of  the  problems  which  linear  programing  can 
treat."7 

There  are  many  methods  used  in  solving  problems  by 
linear  or  mathematical  programing  methods.  Some  tyoes  of 
problems  lend  themselves  to  specific  forms  of  solution  better 
than  they  do  to  others.  At  the  present  time  theoretical  work 
in  methods  of  linear  programing  is  ahead  of  the  application  of 
these  techniques,  in  other  words,  more  is  presently  known 
about  tile  subject  than  is  put  into  actual  practice. 

7 

Joseph  0.  Harrison,  Jr.,   Linear  Programming  and 
Operations  Research,"  Operations  Research  for  management,  ed. 
Joseph  F.  r-cCloskey  and  Florence  N.  Trefethan  (Baltimore: 
The  Johns  Hopkins  Press,  1954),  p.  237. 
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It  is  impossible  within  the  scope  of  this  r>aper  to 
discuse  in  detail  all  of  the  presently  existing  methods.   The 
following  are  a  few  of  the  procedures  used  and  a  selected 
bibliography  for  reference: 

1.  Transportation  Procedure^ 

2.  The  Simplex  Method9 

3.  Prof it- Preference  Procedure 

4.  The  Solution  by  a  System  of  Ordinary  Differential 
Equations 

5.  The  Dual  Method12 

Perhaps  the  best  tinown  method  is  the  Simplex  Method. 
This  method  requires  that  a  mathematical  solution  is  feasible 
at  the  start,  however,  it  need  not  necessarily  be  a  practical 
solution  from  a  business  point  of  view.   A  sequence  of  feasible 
solutions  are  computed;  each  one  is  computed  from  its 
predecessor  in  a  manner  such  that  the  best  answer  is  gradually 

^Henderson  and  Schlaifer,  p.  95. 

9Ibld.«  p.  100. 

10 

A.  Charnes,  ¥.  W.  Cooper,  D.  Farr,  "Linear  Programming 

and  Profit-Preference  Scheduling  for  a  Manufacturing  Firm," 
Journal  of  the  Operations  Research  Society  of  Ameri c a ,  I 
May,  1953),  VV-    114-29. 

11 

Q-.  >.r.  Brown  and  J.  Von  Neuman,   solutions  of  Games 

by  Differential  Equations,"  Contributions  to  the  Theory  of 
Games  (Princeton,  N.  J.:  Princeton  University  Press,  1950) 
ed.  by  Harry  '*.  Kuhn  and  A.  tf«  Tucker,  pp.  73-9. 

12 

C.  F.  Lemke.  HThe  Dual  Method  of  Solving  the  Linear 
Programming  Probler, '  ;.ayal  Research  Logistics  Quarterly. 
I  (March,  1954),  pn.  36-^7. 
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approached  and  that  a  previous  solution  is  never  repeated. 

There  are  a  finite  number  of  solutions  between  the  range  of 
the  original  one  and  the  optimum.   These  operations  are  carried 
out  by  simple  repetitive  and  tedious  arithmetic  steps.   It  is 
possible  in  the  case  of  simple  problems  to  use  the  hand  method 
or  desk  calculators,  however,  for  large  problems  electronic 
computers  would  be  required. 

The  easiest  procedure  and  the  one  I  prefer  is  the 
Transportation  Problem  Procedure.   The  name  of  this  procedure 
is  somewhat  misleading  and  has  the  connotation  that  this 
procedure  can  only  be  used  for  transportation  problems, 
however,  there  are  a  multitude  of  problems  that  can  be  solved 
by  the  procedure.   It  acquired  this  misleading  term  by  being 
first  applied  to  determine  a  lowest  cost  shipping  program. 
Appendix  "C"  contains  instructions  for  its  use  and  provides 
step  by  step  instructions  for  solving  a  problem  concerning 
empty  freight  cars. 

The  other  methods  listed  are  usually  applied  to 
specific  types  of  problems  for  which  they  are  better  suited 
than  the  simplex  method.   By  and  large,  however,  experience 
has  shown  that  the  Simplex  method  is  the  most  probable  method 
to  use. 

Nummary 

In  summary  we  can  state  briefly  that  Linear  Programing 

is  a  mathematical  method  which  can  be  applied  to  a  variety  of 
problems  both  civilian  and  military — and  of  particular  use  to 
the  comptroller  in  those  areas  of  cost;  and  the  techniques  of 
Linear  Programing  can  be  used  to  find  efficient  solutions  to 
these  problems. 


CHAPTER  V 

OPERATIONS  RESEARCH 

There  is  a  new  concept  in  management  that  recently 
has  become  of  vital  Interest  to  the  comptroller.   It  Is  called 
operations  research.   The  basic  characteristic  of  operations 
research  is  the  fact  that  the  majority  of  the  practitioners 
were  trained  in  the  basic  sciences  rather  than  in  engineering 
or  administration.   Some  other  important  characteristics  are: 
that  it  is  concerned  with  research  on  the  operations  of  the 
whole  organization;  the  optimization  of  operations  in  a  manner 
that  brings  about  greater  assurance  of  both  short  and  long 
range  health  for  the  organization;  the  newest  scientific 
methods  and  techniques  are  applied  to  the  proble;  synthesis 
and  extension  of  the  methods  and  techniques  of  the  older 
management  sciences,  analytical  models  are  developed  and  used 
in  a  manner  common  to  the  basic  sciences;  experimental  opera- 
tions are  designed  and  used  in  a  manner  that  give  an  insight 
into  the  behavior  of  actual  operations;  and  the  use  of  inte- 
grated and  creative  multi-disciplinary  team  research  to  solve 
complex  operational  problems. 


Joseph  F.  VioCloskey  and  Florence  N.  Trefethen  (ed.), 
Operations  Research  for  Management  (Baltimore:  Johns  Hopkins 
Press, 1954) i  p.  xiv. 
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Operations  P.eeearch  Defined 

By  definition  Operations  Research  is: 

The  prediction  and  comparison  of  the  values, 
effectivenesses,  and  costs  of  a  set  of  proposed 
alternative  courses  of  action  involving  man-machine 
systems.   To  do  this,  it  uses  a  model  of  the  action 
that  has  been  developed  analytically  by  a  logical 
and  when  feasible,  a  mathematical  methodology.   The 
values  of  the  basic  action  parameters  are  derived 
from  historical  analysis  of  past  actions  or  from 
designed  operational  experiments.   Most  importantly, 
because  all  human  and  machine  factors  are  meant  to 
be  included,  an  estimate  of  the  uncertainty  in  the 
predicted  outcome  and  in  the  values,  effectivenesses, 
and  costs  of  the  proposed  action  is  provided.^ 

In  defining  and  attempting  to  understand  operations 
research  it  is  often  confused  with  statistics,  especially  as 
applied  to  the  body  of  specific  techniques  based  upon 
probability  theory.   The  operations  research  analyst  does, 
however,  use  statistical  methods  when  applicable  to  finding 
the  solution  of  the  particular  problem  at  hand.   Because  of 
the  wide  spread  use  of  statistical  technique  and  method  used 
in  the  solution  of  many  problems  by  the  operational  research 
analyst,  I  felt  that  a  discussion  of  this  technique  was  appro- 
priate in  this  reaper  dealing  primarily  with  statistical  method. 
Although  the  operational  research  analyst  does  use  statistical 
techniques  often  he  is  not  restricted  to  them.   Statistics  is 
concerned  primarily  with  the  relations  between  numbers,  while 
operations  research  is  concerned  with  reaching  an  understanding 
of  the  operation- -of  the  underlying  physical  system  which  the 
numbers  represent.  This  could  cause  a  significant  difference 
in  the  results  ae  well  as  the  approach.   For  example  in  a 


2 
Ibid. ,  p.  xxiii, 
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recent  advertising  study,  the  operations  research  team  found 
the  key  to  characterizing  the  way  in  which  advertising  affected 
consumers  in  the  results  of  a  series  of  Msnl it-run"  tests; 
earlier  results  had  been  presumed  useless  after  statistical 
methods,  such  as  analysis  of  variance  and  multiple  regression, 
had  failed  to  show  meaningful  conclusions. 

Operations  research  is  also  some times  confused  with 
accounting.   This  is  particularly  true  when  the  control  aspects 
of  accounting  are  being  considered,  however,  it  should  be 
noted  that  accounting  is  one  of  the  principal  sources  of  data 
to  support  an  operations  research  study.   Within  the  analysis 
and  construction  of  measures  of  control  and  the  functions  of 
operations  research  there  is  some  overlap  as  well  as  a  mutual 
interest  on  the  part  of  the  accountant  and  the  operations 
research  analysts.   It  has  been  said  in  comparing  the  two  fields 
that  the  accountant  has  served  a  useful  puroose  in  bringing 
the  importance  of  control  measures  to  the  attention  of  business 
management,  while  operations  research  has  shovm  ability  in 
building  new  methods  for  developing  and  implementing  these 
concepts  of  control.  Other  services  such  as  marketing  research, 
engineering,  and  the  industrial  engineering  field  are  sometimes 
confused  with  operations  research.   Perhaps  the  most  significant 
difference  between  operations  research  and  these  other  services 
lies  in  the  type  of  people  employed.   Operations  research 
people  are  scientists,  not  experts.  Their  value  is  not  in 
their  knowledge  or  business  experience  but  rather  in  their 
attitude  and  methodology.   It  should  be  noted  also  at  this  point 
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that  operations  research  is  not  in  competition  with  the  other 
service?,  but  rather  compliments  the  other  services  and 
frequently  serves  to  integrate  other  Information,  and  uses 
the  expert  opinion  and  factual  data  provided  by  other  services 
in  an  organized  and  systematic  analysis.   A  well  organized 
operations  research  group  should  have  available  the  services 
and  counsel  of  experts  in  th«  other  fiells  for  the  most 
effective  joint  attack  on  management  problems. 

Types  of  Problems 
In  chapter  II  of  this  paper  it  was  noted  that  operations 
research  had  its  beginning  primarily  in  solving  military 
problems  of  a  tactical  and  strategic  nature.   It  is  well  at 
this  ooint  to  list  some  areas  of  application  other  than  those 
of  a  strictly  military  nature.   Today  operations  research  is 
used  to  solve  such  diverse  business  problems  as  directing 
salesmen  to  the  right  accounts  at  the  right  time,  dividing 
the  advertising  budget  in  the  most  effective  way,  establishing 
equitable  bonus  systems,  improving  inventory  and  reordering 
policies,  planning  minimum-cost  production  schedules,  and 
estimating  the  amount  of  clerical  heln  needed  for  a  new 
operation  in  order  to  accomplish  these  tasks.   Operations 
research  analysis  uses  the  mental  processes  and  the  methodolo- 
gies which  we  have  come  to  associate  with  the  research  work  of 
the  ohysicist,  the  chemist,  and  the  biologist — what  has  come 
to  be  called  "the  scientific  method," 

^Cyr-il  C.  Herrmann  and  John  F.  Magee,  "Operations 
Research  for  Management,*1  Harvard  Business  Review  (July- 
August,  1953),  n.^103. 


79 

The . Scientific  a Method 

The  basic  premise  under-lying  the  scientific  method 
is  the  belief  that  phenomena  have  a  cause,  and  it  is  the 
scientist's  contention  that  by  hard  work  the  mechanism  or 
system  underlying  the  observed  facts  can  be  discovered  and 
used  to  the  investigator's  own  best  advantage. 

The  scientif5c  method  calls  for  a  special  mental 
attitude,  thnt  la  a  reverence  for  facts,  facts  that  not  only 
explain  the  result  but  those  that  exnlain  the  causes  also. 
';hen  the  executive  looks  at  sales  figures,  he  sees  them  in 
terms  of  the  success  of  the  sales  campaign  and  how  it  will 
affect  profits,  while  on  the  other  hand  the  scientist  looks 
at  these  san;e  figures  and  seeks  a  clue  to  the  fundamental 
behavior  pattern  of  the  customers.  By  the  process  of  induc- 
tion he  tentatively  formulates  a  theoretical  system  or 
mechanism;  then  by  an  inverse  process  of  induction  he  can 
determine  what  phenomena  should  take  olace.   He  then  checks 
these  against  the  observed  facts.   His  test  is  simply  a  matter 
of  determining  if  quantitative  data  is  produced  such  as  can 
be  used,  for  predicting  how  the  customers  will  behave.   For 
example: 

In  a  company  manufacturing  specialty  products, 
examination  of  account  records  showed  that  customer 
behavior  could  be  accurately  described  as  a  time- 
dependent  Poisson  process — a  type  of  phenomenon  found 
widely  in  nature,  from  problems  in  biology  to  nuclear 
physics.   This  concept  yielded  the  key  to  establishing 
measures  of  the  efficiency  of  the  salesmen's  work  and 
of  the  effect  of  the  promotion  in  building  sales. 
On  this  basis  a  new  method  of  directing  promotional 
salesmen  to  appropriate  accounts  was  constructed-- 
and  then  tested  by  careful  experiments ,  to  see  if 
sales  increases  resulted  at  leee  than  proportionate 
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increases  in  cost  (the  results  in  this  case 
were  spectacular:   an  overall  sales  rise  in 
six  figures,  and  a  corresponding  gain  in  net 
profits).^ 

Basic  Concepts 

There  are  four  basic  concepts  important  to  the 
technique  of  operations  research.   These  are  (1)  the  model, 
(2)  the  measure  of  effectiveness,  (3)  the  necessity  for 
decision,  and  (4)  the  role  of  experimentation. 

The  first  concept  the  model  is  a  simplified  representa- 
tion of  an  operation,  containing  only  those  aspects  which  are 
of  primary  importance  to  the  problem  under  study.   The  model 
greatly  facilitates  the  investigations  of  operations.   The 
operations  research  analysts  uses  a  model  in  much  the  same 
manner  as  the  aeronautical  engineer  UM  a  model  airplane  to 
investigate  the  aerodynamic  properties  in  a  wind  tunnel;  or 
the  accountant  who  uses  the  accounting  model,  a  simplified 
representation  on  paper,  in  the  form  of  accounts  and  ledgers, 
of  the  flow  of  goods  and  services  through  a  business  enterprise, 
and  thus  measures  the  rate  of  flow,  the  values  produced,  and 
the  performance  achieved.   Many  models  are  also  used  in  physics. 
These  are  sets  of  mathematical  equations. 

Operations  research  models  rre  either  exact  or 
probabilistic:   (1)  The  exact  model  is  used  in  oases  where 
chance  plays  a  small  role  and  the  effect  of  a  given  action  is 
easily  and  reasonably  determined;  such  as  in  long  range 
production  scheduling  in  the  face  of  known  demand.   This  type 

4Ibld.,  p.  101,  ff. 
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of  model  Is  accurate  enough  so  that  except  for  some  very 
unusual  circumstance,  over  the  long  run  planned  and  actual 
production  will  be  reasonably  close.   (2)  The  probabilistic 
model  explicitly  recognizes  uncertainty  such  as  would  be 
the  case  in  advertising  where  unpredicability  of  the  consumer 
plays  a  great  role.   The  methods  developed  for  physical 
problems  involving  mass  behavior  under  random  conditions  can 
be  applied  easily  and  with  great  value  to  operations. 

The  second  basic  concept,  measure  of  effectiveness, 
is  a  measure  of  the  extent  that  the  operation  is  attaining 
its  goal.   One  measure  of  effectiveness  if  the  return  on 
investment  or  the  net  dollar  profit. 

The  third  concept  is  that  of  decision  making,  and  the 
existence  of  alternative  courses  of  action.  The  objective  of 
operations  research  is  to  clarify  the  relation  between  the 
several  courses  of  action,  and  to  determine  their  outcomes, 
and  to  decide  which  course  of  action  measures  up  best  in  terms 
of  the  company  goal. 

The  fourth  concept  concerns  the  role  of  experimentation. 
Operations  research  is  essentially  the  application  of  science 
to  the  study  of  operations.  The  theory,  or  model,  is  built 
up  from  observed  data  or  from  experience.   There  are  two  kinds 
of  experiments:   the  first  is  designed  simply  to  get  informa- 
tion, the  other  type  is  designed  to  test  the  validity  of 
conclusions. 

The  application  of  the  basic  concepts  to  management 
problems  present  some  areas  of  difficulty.   The  first  problem 
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is  the  choosing  of  the  initial  area  of  investigation, 
selecting  the  personnel  to  conduct  the  work  and  developing 
organizational  plans  for  future  growth.   In  commencing  an 
operations  research  study  or  investigation  there  should  be 
an  opportunity  for  decision  between  alternative  courses  of 
action.   The  problem  must  be  capable  of  quantitative  measure- 
ment and  study.  The  possibility  for  data  collection  must  be 
present,  and  the  possibility  of  evaluating  results  must  be 
present.  Management  and  this  applies  to  comptrollers  in  the 
military  have  two  alternatives  in  obtaining  operations 
research  people.   The  use  of  outside  consultants  may  be 
resorted  to  or  train  members  within  the  organization.   Both 
alternatives  have  their  advantages  and  disadvantages.   Some- 
times a  compromise  is  effective;  that  is,  having  an  outside 
group  assist  in  getting  an  internal  organization  started. 

Operations  research  should  only  be  undertaken  with  the 
intention  of  continuing  and  expanding  its  use.   The  investment 
in  knowledge  and  methodology  on  the  staff  is  too  valuable  a 
commodity  to  throw  away. 

Evaluation 

Case  histories  have  shown  that  the  techniques  of 
operations  research  provides  a  basis  for  objective  analysis 
of  operating  problems.   These  studies  are  quantitative  and 
provide  the  basis  for  sound  estimates  of  requirements,  objec- 
tives and  goals,  and  a  basis  for  planning  and  decision  making. 
Through  operations  research  there  has  been  a  technique  for 
the  application  of  organized  thinking  to  data  already  existing 
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wlthlr]  the  organization,  and  for  the  introduction  of  new 
concepts  and  new  methods  of  analysis. 

However,  we  must  make  this  observation.   Operations 
research  is  not  a  cure-all  for  every  management  ill,  nor  is 
it  a  source  of  automatic  decisions.   It  is  limited  to  the 
study  of  tangible,  measurable  factors.   The  many  factors 
affecting  management  decisions  that  are  intangible  and  quali- 
tative must  be  evaluated  on  the  basis  of  executive  judgment 
and  intuition. 

The  fact  that  operations  research  is  scientific  in 
character  rather  than  expert  means  that  more  time  is  normally 
required  to  arrive  at  any  solution  or  conclusion  than  in  the 
case  of  normal  engineering  analyses. 

In  conclusion,  the  future  of  operations  research 
appears  bright.   Successful  applications  in  many  fields  of 
endeavor  and  especially  industry  are  overcoming  the  skepticism 
of  many  people.   The  areas  of  application  of  the  science  is 
broad.   The  new  and  relatively  untested  theory  of  games  hold 
promise  for  the  development  of  strategic  concepts.   Operations 
research  can  and  will  live  up  to  its  expectations  of  helping 
executives  to  make  decisions. 


CHAPTER  VI 

CONCLUSION 

In  the  preceding  chapters  Me   have  discussed  statistical 
method  and  its  relation  to  the  comptroller  along  with  a  brief 
discussion  of  the  historical  background  concerning  the  develop- 
ment of  ma the mat leal  procedures  for  decision  making.   The 
statistical  methods  and  case  studies  were  of  necessity  brief, 
but  It  is  hoped  that  they  served  to  illustrate  the  various 
methods  of  application  in  solving  typical  management  problems. 
Perhaps  some  enthusiasm  has  been  engendered  for  the  science 
of  mathematical  decision  making.   A  word  of  caution  is  in 
order,  however,  these  mathematical  methods  and  especially 
those  based  on  probability  theory  are  only  a  whetstone  for 
common  sense  in  their  applications  to  problems  of  the  comptrol- 
ler and  executive  decision  making.   Administrative  methods 
already  in  use  within  the  organization  are  sharpened  with  this 
scientific  tool  which  enable  the  user  to  locate  and  to  make 
the  most  of  significant  factors  that  bear  on  the  problem  under 
analysis.   It  is  appropriate  to  emphasize  that  there  is  no 
substitute  for  knowledge,  not  necessarily  knowledge  derived 
from  books  or  a  college  education,  but,  empirical  knowledge 
of  one's  own  business.   For  example,  the  small  business  man 
is  better  able  to  predict  his  firm's  sales  for  the  coming 
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year  than  a  college  trained  statistician  who  ii  unfamiliar 

with  the  firm  or  industry.  The  statistical  formulas  presented 
in  this  ioaper  cannot  evaluate  nor  recognize  and  give  cognizance 
to  the  multitude  of  factors  which  bear  on  the  future.   This 
is  not  to  say  that  statistical  formulas  are  useless  when  a 
multitude  of  data  are  available.   On  the  contrary,  statistical 
methods  are  often  the  only  means  of  determining  some  order 
out  of  unordered  and  unrelated  facts.   It,  therefore,  appears 
evident  that  an  intelligent  forecast  based  upon  sound  analysis 
of  the  best  available  data  is  far  superior  to  a  forecast  based 
unon  hunches  and  rumors. 

How  Much  Statistics  Should  the  Comptroller 
Know? 
The  comptroller  if  he  is  to  have  an  adequate  apprecia- 
tion of  the  use  of  statistical  method  in  decision  making 
should  acquaint  himself  with  the  following  broad  categories 
of  the  statistical  field,  namely: 

1.  Sources  of  Information 

2.  Decision  Making  Tools 

3.  Forecasting 

4.  Administrative  Controls 

5.  Presentation  of  Data 
Sources  of  Information 

The  comptroller,  for  an  adequate  understanding  of 
this  nhase  should: 

1.   Understand  fully  the  problem  for  ^rhich  he 
seeks  information. 
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2.  Be  able  to  determine  what  kinds  of  data  are 
necessary  for  the  utilization  of  decision  processes. 

3.  Ee  aware  of  the  limitations  of  data  obtained 
within  his  own  organisation. 

4.  Be  familiar  with  listings  of  source  material  such 
as  specialized  governmental  and  private  publications. 
Decision  -.Making;  Tools 

The  comptroller  should  develop  an  adequate  vocabulary 
in  the  field  of  statistical  decision  theory  in  order  that  he 
may  communicate  more  effectively  with  his  statistical  staff. 
He  should  understand  the  appropriateness  cf  the  methods  being 
employed  in  solving  a  particular  nroblem.   The  comptroller 
should  have  some  understanding  of  the  formulas  involved  in  the 
process,  and  more  important,  of  the  tine  involved  in  the 
application  of  these  formulas.  He  should  be  familiar  with 
modern  computing  techniques  in  order  that  he  n?ay  know  where 
to  turn  for  help  and  guidance. 
Forecasting 

Planning  is  a  fundamental  oroblem  cf  the  comptroller, 
and  planning  involves  projection  into  the  future.  All  fore- 
casting involves  some  degree  of  uncertainty.  With  adequate 
statistical  knowledge,  the  comptroller  can  reduce  his  "uncert- 
ain stabbing"  decisions  and  base  each  more  on  a  sound 
scientific  basis  with  established  and  predetermined  risks. 

A  good  example  of  the  problem  encountered  in  fore- 
casting is  given  by  Mark  Twain  when  he  summed  up  the  nonsense 
side  of  extrapolation  in  his  book  "Life  on  the  Mississippi;" 
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In  tire  snace  of  one  hundred  and.  seventy  six  years 
the  lower  Mississippi  has  shortened  itself  two 
hundred  and  forty-two  miles.   That  is  an  average  of 
a  trifle  over  one  mile  and  a  third  per  year.   Therefore, 
any  calm  person,  who  is  not  "blind  or  idiotic,  can  see 
that  in  the  old  oolitic  oilurian  period  just  a  million 
years  ago  next  November,  the  lower  Mississippi  river 
was  upward  of  one  million  three  hundred  thousand  miles 
long,  and  stuck  out  over  the  Gulf  of  Mexico  like  a 
fishing  rod.  And  by  the  same  token  any  person  can  see 
that  seven  hundred  and  forty-two  years  from  now  the 
lower  Mississippi  will  be  only  a  mile  and  three  quarters 
long,  and  Cairo  (Illinois)  and  New  Orleans  will  have 
joined  their  streets  together,  and  be  plodding 
comfortably  along  under  a  single  mayor  and  mutual 
board  of  aldermen.  There  is  something  fascinating 
about  science.   One  gets  such  wholesale  returns  of 
conjecture  out  of  such  a  trifling  investment  of  fact. 

Administrative  Controls 

Concerning  administrative  controls  it  is  a  well  known 
» 
fact  that  these  controls  are  of  little  value  if  they  are  not 

carried  out.   The  comptroller  must  have  sufficient  knowledge 
of  statistical  control  techniques  to  be  able  to  interpret 
changes  in  control  on  an  objective  basis.   He  must  have  an 
understanding  of  the  role  played  by  probability  in  the  control 
situation,  and  be  able  to  evaluate  departures  from  control 
and  to  take  appropriate  action  when  control  is  violated. 
Presentation  of  Data 

The  best  techniques  in  decision  making,  forecasting 
and  control,  are  of  little  value  if  the  results  are  not 
adequately  presented.   The  comptroller  should  select  tabular 
and  graphic  devices  which  most  effectively  communicate  the 
desired  message.   To  illustrate  this  point  an  editorial  writer 
of  a  financial  magazine  reproduced  a  chart  from  an  advertise- 
ment in  Washington,  D.  C.   The  argument  stated  Government 
Payrolls  Up.   Figures  on  the  graph  showed  an  increase  from 
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19.5  rnllllon  to  20.2  million.   By  rescaling  the  graph  an 
entirely  different  version  was  given  by  this  magazine 
Government  Payrolls  stable! 


To  summarize,  it  can  be  seen  that  the  well  versed 
decision  making  comptroller  must  have  a  vast  fund  of  knowledge 
available  to  him.   Much  of  this  knowledge  empirical  from  a 
particular  field.   The  comptroller  must  have  the  knowledge  at 
his  finger  tips  or  know  where  to  get  it.  He  need  not  be 
proficient  in  the  deep,  mystical,  and  theoretical  foundations 
of  statistics;  however,  he  needs  to  know  enough  of  the  surface 
material  not  to  be  misled  or  confused.   The  comptroller  must 
trust  the  foundations  on  which  his  decision-making  processes 
are  based,  for  then  only  will  he  have  faith  in  the  course 
of  action  he  chooses  to  take. 
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VI  APPENDIX 

A   TABLES 

Table  I 

Selected  Normal  Probabilities 

z 

%   Area 

.126 

.05 

.253 

.10 

.385 

•15 

.524 

.20 

.6745 

•25 

.842 

.30 

1.036 

.35 

1.282 

.40 

1*645 

.45 

1.960 

.475 

2.326 

.49 

2.576 

.405 

3.090 

.499 
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TABLE   II 

XXXXXX  (continued) .    Confidence  coefficient,  I  -  2a  «■  0-09. 


.  06' 


cfn 


*<*     «-<»     ov«     t-M     hi     »i4    o-u     ».w     o-a»     on     *•!»«     o-» 


O-M      0-«0       »■«       »M       0-36       0  M       ttt       o«J       0*4       M*       V*» 


d*. 


T         The  numbers  printed  al  .4  the  curves  indicate  the  sample  sice  n. 


Note:  the  process  of  reading  from  the  curves  can  be  simplified  with  the  help  of  the  right-angled  corner  of  a  loose 
sheet  of  paper  or  thin  card,  along  the  edges  of  which  are  marked  off  the  scales  shown  in  the  top  left-hand  corner  of 
each  Chart. 
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* 


Thie  table  \a  reproduced  from  BBr«e»triki:  liable  a  Tor  Statiatiaiaae*  Ibl.  I 
{195k),  edited  by  X.   S.  Fearaon  end  H.   0;  Hartley,  by  peradseioa  of 
Profeeaor  I*  S.  Feareon. 
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TABLE   III 

XXX3CXSC   Chart  providing  cxrnfide.net  Units  for  p  in  binomial  sampling,  given  a  sample  fraction  s/n. 
„ . ,_  Confide  xe  coefficient,  1  —  2a  =  0-95. 


\-m  /.to 


The  numbers  printed  Along  the  curves  indicate  the  sample  size  n.    If  for  a  given  value  of  the  abscissa  e/n,  pA  and  pf 
'are  the  ordinate*  read  from  (or  intar{>olatcd  betwoon)  tho  appropriate  lower  and  uppor  ourvM,  then 

p  Vr{pA*p<pB)<\-2*. 

'  204 

This  tafcle  I*  reproduced  from  E.  S.  Pearson  and  H.  0.  lartley,  "Bicswtrika 
Taales  for  Statisticians",  Tol  I  (195*),  *7  permission  of  Professor 

E.  S.  Paarson. 
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TABLE   IV 

xxxxxx  Chart  giving  confidence  limits  for  the  population  correlation  coefficient,  p,  given 

the  sample  coefficitri,  r.  Confidence  coefficient,  I  —  2a  —  0-95 


■M 


-7         -»         =T^ 


-^?        Scale  of  r  (sample  correlation  coefficient) 


<      ♦  •>      +■ 


V 


7         *-t         *  -9       *  .-0 


".*? 


The  Humbert  on  the  ourvea  indicate  sample  size.  The  chart  eau  also  be  used  to  determine  upper  and 

lower  2- 6  %  significance fl§k^k£Qf ■*,  given  p. 


0« 


This  tabid  Is  reproduced  frqn  "Brometrika  Tables  For  Statisticians* 
Jbl.  I(lg$li),  adited  by  E^6.   Pearson  a*l  Ho   0.   Hartley,  by 

pertrJtsi3N>R  of  Professor  E.  S»  Pearson, 
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TABLE  V 
T3&X&X1X  {continued).    Confidence  coefficient,  1  —  2a  =  0-99 


f^O1'0      ~'9  ~'7  -•»-•« 


•I +-Z        +  •*        +•«        +"*        +-6        +7        +•«        +•?        +1-0 


+  •9 


+  •5         +-6         +-7         +-8         +-9       +1-0 


Scale  of  r  (sample  correlation  coefficient) 

The  numbers  on  the  curves  indicate  sample  size.  The  chart  can  aNo  be  used  to  determine  upper  and 

lower  0-6  %  significance  points  for  r,  given  p. 


This  table  la  reproduced  from   RBrometrika ..Tables  For  Statistic! ana" 
Vol.  I  (19$k)»   edited  by  E.  S.  Pearson  unci  H.  0.  Hartley,,  by  permission 
f  Professor  £.  S.  Pearson* 
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APPENDIX  B 
Directions  for  Computing  the  Mean  and  Standard 
Deviation  From  a  Frequency  Distribution 


Distribution: 

X 

:   f   : 

fX   , 

,  d 

fd 

:   fd2  A 

60-64   : 

62 

:   3   : 

186 

:  -3   I 

-9 

:   27  : 

65-69   : 

67 

:   8   : 

536 

i  -2 

-16 

:   32   : 

70-74  : 

72 

:  17   : 

1224   , 

:   -1 

i  -17 

:   17  : 

75-79  : 

77 

1  27  ! 

2079   : 

:   0   : 

0 

:    0   ; 

80-84  : 

82 

:  25   : 

2050   ! 

+1   l 

I   +25 

:   25  : 

85-89  : 

87 

:  20   : 

1740   ! 

+2   j 

+40 

:  80   : 

Totals      : 

:  100  : 

7815 

+23 

:  181   : 

X  =  The  mid  point  between  the  numbers  in  the  distribution. 

f  =  The  frequency  or  number  times  these  observations  occur 
in  the  distribution. 

fX  sb  Frequency  times  the  mid  point. 

d  =  Deviations  from  the  mean. 

fd  m   Frequency  times  the  deviations  from  the  mean. 

fd2  =  d  times  fd. 

Z  =  Means  to  sum  or  add  up. 
To  compute  the  mean: 


Meaa  = 


2fX 


7815 


n 


100 


X  ■  78.15  X  =  Mean,   n  =  number  of  observations 

in  the  distribution 
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APPENDIX  B  (CONTINUED) 

To  compute  the  Standard  Deviation: 

S  =  Standard  Deviation  and  i  a  the  class  interval  of  the 
distribution  60  to  65  to  70  etc. 


fd 


H2-) 


181 

S  =  5     \/        

100 


V   lo°    7 


S  =  5  \/     1.81   -    .0529  s  =  6.625 

S  =  5  \/     1.757100 


/ 
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APPENDIX  C 
Directions  for  Solving  Problems  by  the 
Transportation  Problem  Procedure 


This  sample  consists  of  assigning  the  manner  of 
shipping  railroad  empties  from  three  shipping  points  having 
excess  number  of  empties  to  five  destinations  having,  in 
total,  the  same  number  of  freight  car  shortages.   The  data 
for  the  problem  is  given  in  Table  A  below;  that  is,  the 
shipping  rates  from  each  shipping  point  to  each  destination, 
the  excesses  at  each  shipping  point,  and  the  shortages  at 
each  destination. 

Surplus 
(A)         Dl     D2     D3     D4     D5      Totals 


SI 

$10 

20 

5 

9 

10 

9 

S2 

2 

10 

8 

30 

6 

4 

S3 

1 

20 

7 

10 

4 

8 

Shortage 

3 

5 

4 

6 

3 

21 

Totals 

It  is  first  necessary  to  get  a  feasible  solution,  that 
is,  one  which  meets  the  fixed  requirements  (of  excesses  and 
shortages)  regardless  of  cost.   This  solution  may  be  obtained 
as  follows.  Take  the  9  cars  at  shipping  point  SI  and  fill 
the  shortages  of  3  cars  at  Dl  and  5  at  D2;  put  the  remaining 
car  at  D3«   Take  the  4  cars  at  32  and  put  3  more  at  D3  to 
complete  the  shortage  there  of  4,  and  put  the  fourth  car  at 
D4.  Take  the  8  cars  at  S3,  put  5  at  D4  to  complete  the 
shortage  there  of  6  and  put  the  remaining  3  at  D5,  which 
fills  the  shortage  there  of  3.   The  procedure  could  obviously 
be  used  to  fill  a  table  of  any  size.   This  solution  is  entirely 
feasible  and  would  cost  $351» 

(3  x  10  +  5  x  ^20  +  1  x  |5  +  3  x  |8  +  1  x  f30  +  5  x  |10  +  3 
x  |4  =  |35D 
The  table  would  then  be  shown  in  Table  B. 

(B)  Dl     D2     D3     D4     D5 

51  3      5      1  9 

52  3      14 

53         5      3 8_ 

3     5     4     6     3      21 

Since  we  learned  in  Chapter  4  that  the  method  we  are 
using  leads  to  the  optimum  solution  by  proceeding  through  a 
series  of  solutions,  each  closer  to  the  optimum  than  its 
predecessor,  and  never  repeating,  it  is  obvious  that  we  may 
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save  ourselves  a  number  of  computational  steps  If  we  obtain 
a  first  solution  which  is  the  most  economical  we  can  get  by 
quick  inspection.   Stated  differently,  if  we  use  common  sense 
to  get  a  starting  point  which  is  fairly  inexpensive  in  terms 
of  shipping  costs,  we  will  save  ourselves  a  great  deal  of 
computational  effort,  even  in  a  problem  as  short  and  simple 
as  the  example  we  are  using  for  Instruction  purposes.   There- 
fore let  us  attack  the  problem  as  follows.  Take  any  ship- 
ping point  at  all  and  use  the  capacity  (number  of  empties) 
there  to  fill  those  destination  requirements  (shortages)  which 
by  the  cost  of  shipping  it  seems  most  economical  to  fill. 
V»hen  that  capacity  is  exhausted,  take  another  shipping  point 
and  do  the  same,  first  filling  any  requirements  that  were  only 
partially  satisfied  by  the  preceding  shipping  point.   Using 
this  procedure  we  can  proceed  as  follows.  Take  the  8  cars  at 
83  and  put  3  at  Dl  at  $1  each;  put  3  of  the  remaining  at  D3 
at  §4  each;  and  the  other  2  at  D3  at  #7  each.  Fill  03*8 
remaining  requirements  with  2  cars  from  52,  and  put  the  remain- 
ing 2  from  S2  at  D2  (both  Dl  and  D5  cost  less  but  are  already 
full  from  the  shipment  from  ZJ>) .   Finally,  complete  the  table 
by  placing  the  9  cars  from  SI,  3  at  D2  and  6  at  D4.   live  then 
have  Table  C,  at  a  cost  of  #179 1  which  is  considerably  better 
than  our  random  solution  of  Table  B,  and  therefore  offers  us 
fewer  steps  to  go  through  to  achieve  the  optimum  solution. 


(O 

SI 

S2 


Dl 


D2 

3 

2 


D3 


2 

2 


D4 


D5 


9 
4 

8 


It  is  next  necessary  to  build  up  a  type  of  cost  table, 
First,  fill  in  a  table  showing  the  actual  shipping  costs, 
taken  from  Table  A,  for  those  shipments  which  are  actually  in 
use  in  Table  C.  This  gives  Table  D.  We  can  refer  to  the 
"squares"  of  the  table  as  "square  S1D1"  etc.,  and  in  Table  D, 
S1D4  is  equal  to  $9,  or  S1D4  =  9. 


(D) 

SI 
S2 
S3 


Dl 


D2 

20 
10 


D3 


8 


D4 
9 


D5 


Second,  in  building  up  the  cost  table,  of  which  Table 
D  is  the  first  step,  assign  row  and  column  values  on  an 
arbitrary  basis;  row  and  column  values  are  the  values  shown 
under  those  headings  in  Table  S.   They  are  obtained  by  assigning 
a  value  to  row  SI,  such  as  zero,  or  1,  or  2,  or  anything  (we 
have  chosen  zero)  and  then  under  every  square  of  row  SI  which 
contains  a  rate  (S1D2,  S1D4,  etc.)  assign  a  column  value  such 
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that  the  sum  of  the  row  and  the  column  values  equals  the  sum 
of  the  value  In  the  table.   Thus,  we  obtain  Table  E,  in  which 
S1D2  is  20,  and  since  the  row  value  was  zero,  the  column  value 
must  be  20;  G1D4  is  9  and  therefore  the  column  value  under  D4 
must  be  9  so  that  the  row  value  zero  and  the  column  value  9 
equal  the  square  S1D4.   It  should  be  noted  that  both  positive 
and  negative  values  of  row  and  column  values  may  occur.   In 
observing  C2D2  we  see  that  we  already  have  a  column  value  of 
20,  and  that  the  square  value  is  10,  and  therefore  the  row 
value  for  S2  must  be  -10.     then  proceed  to  determine  the 
column  value  for  D3,  since  we  have  a  row  value  of  -10  and  a 
square  value  of  8  for  S2D3;  the  column  value  is  necessarily  18. 
Table  E  is  completed  by  similar  steps  to  give  the  values  shown. 


(E) 


Dl 


D2 


D3 


D4 


D5 


Row  Values 


SI 

S2 


20 
10 


8 


0 
-10 

-11 


Column  12 
Values 


20 


18 


15 


Finally,  we  make  Table  E  into  a  cost  table,  Table  F, 
by  filling  in  all  the  blank  squares  with  the  sum  of  the 
developed  row  and  column  values  as  they  are  shown  in  Table  E. 
S1D1  then  becomes  12,  the  sum  of  the  SI  value  of  Zero  and  the 
Dl  value  of  12,  etc. 


(F) 

SI 
02 

§2. 


Column 
Values 


Dl  D2     D3     D4  D5     Row  Values 

12  20     18      9  15         0 

2  10      8-1  5        -10 

i    9 2_ -2  * ill... 


12 


20 


13 


15 


As  one  develops  familiarity  with  the  method,  he  omits 
the  majority  of  the  tables  which  have  been  added  for 
instructional  purposes.  All  that  is  needed  is  a  rate  table, 
Table  A;  a  route  table,  Table  C;  and  a  cost  table,  Table  F, 
for  the  first  stage  of  the  method. 

It  is  now  possible  to  determine  what  change  In  our 
route  of  Table  C  should  be  made  to  reduce  our  shipping  cost. 
Examine  the  table  In  comparison  to  the  rate  table,  and  find 
the  square  where  the  figure  in  Table  F  is  larger  by  the 
greatest  amount  than  the  figure  in  Table  A.  By  inspection 
this  is  square  S1D3  in  which  we  find  a  value  of  18  compared 
to  Table  A's  value  of  5.   We  now  know  that  a  change  in  the 
route  so  that  more  cars  go  from  SI  to  D3  we  will  save  |13  on 
each  additional  car  that  we  can  put  on  this  route.   A  detailed 
explanation  of  the  reason  for  this  statement  may  be  found  in 
Henderson  and  Sehlaifer's  article.   The  problem  is  now  to  find 
out  how  many  cars  we  can  shift  to  square  D1D3,  i.e.,  how  many 
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cars  we  can  send  from  shipping  point  SI  to  destination  D3 
instead  of  using  the  solution  of  Table  C.  Me   do  so  as  follows. 
Construct  Table  G,  and  in  square  S1D3  write  +x:   this  is  the 
unknown  amount  which  will  be  shipped  from  SI  to  D3.  The  other 
amounts  in  Table  G  are  obtained  by  copying  Table  C. 

(G)     Dl     D2     D3     D4     D5 

51  3     +x     6*  9 

52  2      2  4 

SJ 2£ 2 2* 8 


But  we  must  know  which  routes  are  unaffected  if  we  are 
to  complete  the  table  properly.   We  mark  those  squares  in 
Table  G,  after  copying  the  Table  C  values,  with  an  *  or  some 
mark,  which  is  the  only  number  in  either  its  row  or  its  column. 
The  +x  in  S1D3  is  counted  as  a  number  for  this  purpose.  Hence 
we  mark  the  value  S3D1  with  an  *,  since  3  is  the  only  number  in 
the  row  Dl;  and  we  mark  the  6  of  S1D4  and  the  3  of  S3D5  with 
an  *•   We  next  consider  the  *  numbers  as  not  in  the  table,  and 
go  through  the  *  procedure  again,  looking  for  any  numbers  which 
are  now  alone  in  either  rows  or  columns .  Thus  we  *  square 
S3D3>  where  the  2  is  the  only  number  meeting  the  requirement 
of  being  alone,  in  this  case  in  row  S3.   We  cannot  *  any  more 
numbers.  We  now  have  Table  H,  except  that  the  only  square  with 
an  x  value  in  it  is  the  +x  in  S1D3.   We  complete  Table  H  by 
disregarding  all  the  *  values  and  filling  in  +x  and  -x  values 
such  that  the  rows  and  columns  add  up  properly  to  the  totals 
shown,  S1D2  must  necessarily  become  3-x  so  that  the  sum  of  the 
three  values  of  row  21  (3-x,  +x,  6)  add  up  to  the  known  total  9# 

(H)     Dl     D2     D3     D4     D5 

51  3-x     +x     6*  9 

52  2+x    2-x  4 

sj 2* 2f 2* *L 


6 


The  maximum  number  of  cars  we  can  divert  to  C1D3  is 
then  2,  for  if  we  diverted  any  more  we  should  have  a  minus 
number  of  cars  for  square  S2D3,  where  our  new  value  is  2-x; 
one  of  the  postulates  of  our  problem  is  of  course  that  we  do 
not  want  negative  shipments,  i.e.,  shipments  from  destinations 
to  shipping  points.   This  restraint  is  normally  referred  to  as 
a  "non-negativity"  restraint.  By  substituting  the  value  of  2 
for  x,  we  obtain  Table  J  (Table  I  omitted)  and  can  determine 
that  the  shipping  cost  is  now  only  <;154,  or  an  improvement  over 
our  value  of  ?179;  and  as  we  said,  we  save  2  cars  at  $13  each, 
or  $26  over  the  first  solution,  or  ''179  -  "?6  -   $154. 


(J) 

£1 
82 

Si. 


Dl 


D2 
1 


100 
D3     D4 
2      6 
2 


D5 


9 

4 

8 


3 


4 


It  is  now  necessary  to  repeat  the  same  procedure, 
beginning  with  Table  J,  as  we  have  just  done  in  beginning  with 
Table  C.  Tables  K  and  L  are  intermediate  steps  to  obtaining 
Table  M.   In  Table  K,  a  row  value  of  10  for  row  SI  was  taken, 
and  the  complete  table  developed  Just  as  was  done  for  Table  F. 
Comparison  of  Table  K  with  Table  A  shows  that  S3D2  in  the 
former  is  higher  in  value  by  2,  and  therefore  that  we  will 
save  |2  for  each  car  that  we  can  shin  over1  that  route  if  we 
make  the  proper  adjustments  in  the  other  routes.   Table  L, 
developed  just  as  was  Table  H,  shows  that  we  can  ship  1  car 
over  route  £3D2.   Our  new  routing  table  is  then  Table  ft,  and 
its  cost  is  fl51»  or  |2  less  than  our  previous  routing  Table  J. 


(K) 

SI 

S2 

S3 


Dl 

-1 
•11 


D2 

20 
10 
22+2 


D3 

5 
-5 


D4 

9 
-1 

11+1 


D5 

2 

-8 

4 


10 

0 

12 


-11 


10 


-5 


-1 


-8 


(L) 

31 
32 

S2L 


01 


.2* 


D2 

1-x 
4* 

+x 


D3 

2+x 

2-x 


D4 
6* 


"35 


21 


9 
4 
8  x  =  1 


(M) 

£1 

S2 

S3L 


Dl 


i 


D2 


D3 
3 
1 


D4 
6 


D5 


--■ 

2 


9 
4 

8 


Once  more  we  go  through  the  procedure,  obtaining 
Tables  N  and  0  in  reaching  Table  P,  and  improve  our  cost 
to  |150« 
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(N)     Dl     D2     D3     D4     D5 


SI 

-1 

18 

5 

9 

2 

9 

S2 

-9 

10 

-3 

1 

-6 

1 

S3 

1 

20 

7 

11+1 

4 

11 

-10 

9 

-4 

0 

-7 

(0) 

Dl 

D2 

D3 

D4 

D5 

SI 

3+x 

6-x 

S2 

4# 

x=l 

SB 

3* 

1* 

1-x 

+x 

3* 

(p) 

Dl 

D2 

D3 

D4 

D5 

51 

4 

5 

S2 

4 

S2 

...  ,  5 

1 

1 

.„? 

Another  trial,  giving  Table  Q,  shows  that  in  no  caee 
is  there  any  value  In  Table  Q  which  is  higher  than  its 
corresponding  square  in  Table  A.    t   know  then  that  we  have, 
in  Table  ?,  reached  the  optimum  arrangement.  There  remain  no 
changes  we  can  make  in  that  tf.ble  to  obtain  a  lower  cost  of 
making  the  shipments  of  empty  freight  cars  that  Table  A  gave 
as  being  required.  We  have  given  the  freight  car  dispatcher 
the  information  on  which  to  make  his  decision;  should  he 
decide,  for  reasons  best  known  to  him,  to  make  a  shipment 
other  than  Table  P  shows,  he  can  compute  the  excess  cost  to 
do  so,  and  base  his  decision  on  concrete  cost  data,  not 
conjecture. 

(Q)     Dl     D2     D3     D4     D5 


SI 

0 

19 

5 

9 

3 

5 

S2 

-9 

10 

-A 

0 

-6 

-4 

83 

1 

20 

0 

10 

4 

6 

-5     14      0      4     -2 
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GLOSSARY 


Arithmetic  mean  -  the  simple  average  of  a  set  of  observations. 
It  is  obtained  by  adding  a  set  of  observations  and 
dividing  by  the  number  of  observations. 

Average  (mean)  deviation  -  the  simple  average  of  the  absolute 
distances  of  the  observations  from  the  mean. 

Binomial  distribution  -  a  distribution  in  which  there  are  only 
two  alternatives.   Sometimes  referred  to  as  a  dichotomous 
distribution.   For  example,  light  bulbs  are  defective 
or  good. 

Binomial  expansion  -  rsising  the  binomial  to  a  given  power. 

Binomial  theorem  -  the  theorem  by  means  of  which  a  binomial 
may  be  raised  to  any  power  without  performing  the 
multiplications.   A  binomial  is  an  expression  consisting 
of  two  terms  connected  hj   a  plus  or  a  minus  sign.   A 
binomial  distribution  is  a  distribution  where  there  are 
only  two  alternatives.   For  example  in  sampling 
light  bulbs,  they  are  either  good  or  bad. 

Coefficient  of  variation  -  a  measure  indicating  the  percent 
the  standard  deviation  is  of  the  arithmetic  mean. 

Correlation  coefficient  -  an  index  measuring  the  extent  of 
relationship  between  two  variables.   It  ranges  from  -1 
(a  perfect  negative  relationship)  to  1  (a  perfect  positive 
relationship) . 

Estimation  -  that  phase  of  inference  in  which  sample  character- 
istics are  used  to  provide  estimates  of  corresponding 
population  characteristics. 

Hypothesis  -  a  statement  of  fact  which  it  is  desired  to  test. 
Usually  referred  to  as  a  null  hypothesis  when  testing  • 
specified  characteristic  of  a  population. 

Inter-quartile  range  -  the  middle  50%   of  the  observations. 
Those  two  values  which  include  the  middle  $0%   of  the 
cases  in  a  distribution. 

Median  -  The  middle  observation  When  a  set  of  data  have  been 
ordered  or  ranked . 

Mode  -  that  observation  in  a  distribution  which  occurs  more 
than  any  other. 
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Normal  curve  -  a  symmetrical  bell  shaped  curve  possessing 
certain  statistical  characteristics. 

Normal  distribution  -  often  referred  to  as  Ga as  si- n.   A  bell 
shaped  curve  possessing  certain  statistical  character- 
istics. 

Parameter  -  a  characteristic  in  a  population*   For  example, 
the  true  proportion  of  Jefectives  in  a  lot  of  100,000 
shells. 

Pols son  distribution  -  a  skewed  distribution  which  is 
encountered  frequently  in  supply  problems. 

Population  -  sometimes  referred  to  as  universe.   The  entire 

group  or  set  of  observations  in  which  we  have  an  Interest, 

Proportion  -  a  measure  between  0  and  1,00  indicating  the  ratio 
of  the  number  possessing  a  given  characteristic  to  the 
total  number  of  observations.   For  example,  a  proportion 
of  .80  means  that  80^  of  the  observations  possess  a 
certain  characteristic. 

Random  -  a  procedure  whereby  only  chance  influences  selection. 
Not  synonymous  with  haphazard. 

Range  -  the  distance  between  the  largest  and  smallest 
observation  in  a  set  of  observations. 

Regression  -  a  study  of  the  predictive  value  of  a  line  in 
explaining  the  degree  of  relationship  between  two 
variables. 

Regression  coefficient  -  the  average  amount  of  change  in  Y 
per  unit  change  in  X. 

Sample  -  a  part,  or  sub-set,  of  a  population  or  universe. 

Skewness  -  a  departure  from  symmetry. 

Standard  deviation  -  the  most  reliable  measure  of  variation. 
Often  referred  to  as  the  root-mean-square. 

Standard  error  of  estimate  -  a  measure  of  the  scatter  about 
the  line  of  regression.  The  standard  deviation  of  the 
observed  ooints  about  the  line. 

Standard  error  of  the  mean  -  the  standard  deviation  of  sample 
means.   A  measure  of  the  variability  in  a  sample  mean. 
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Standard  score  -  the  nuaber  of  standard  deviations  an  observa- 
tion is  from  the  arithmetic  mean.   For  example,  a 
standard  seore  of  -8  indicates  that  the  observation  1b 

two  standard  deviations  below  the  wean. 

Statistic  -  s  characteristic  in  a  sample.   For  example,  the 
proportion  of  defectives  in  •  sample  of  100  shells. 

Statistics  -  that  branch  of  applied  mathematics  which  deals 

with  experimental  design  and  the  collection,  tabulation, 
analysis |  intarpratation,  and  praaantation  of  data. 

T  scale  -  a  theoretical  distribution  that  goes  by  the  name  of 
Student- 1  distribution ,  d •▼sloped  by  '.*.  3.  Gtossst  who 
published  his  work,  under  the  pen  name  of  Student. 

Theory  of  least  squares  -  mathematically  ooaputed  lines  nay 

be  passel  through  the  data,  or  toatter  points,  which  are 
called  regression  lines,  or  lines  of  Average   relatio 
shin,  because  they  reveal  the  typisal  change  in  the 
dependent  variable  Y  which  has,  in  the  past,  accompanied 
a  given  change  in  the  variable  plotted  on  the  X  axis. 
This  average  relationship  may  be  determined  mathematically 
by  the  method  of  least  squares, 

Type  I  error  -  rejecting  B  hypothesis  when  in  fact  it  is  true. 

Tyne  IT  error  -  soe opting  a  hypothesis  when  in  fact  it  is 
false* 

Variable  -  that  which  does  not  maintain  a  constant  value. 
The  existencs  of  variables  brings  forth  the  need  for 
statistics. 


